This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not hmited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 

IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



1 5 2004 




IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Attorney Docket No.: 40383-0006 



Applicant: 



Richard Bruce BRANDON et al. 



Confirmation No.: 



Appl. No.: 



10/712,266 



Examiner: Unassigned 



Filing Date: 



November 14, 2003 



Art Unit: Unassigned 



Title: 



STATUS DETERMINATION 



CLAIM FOR CONVENTION PRIORTTY 



Commissioner for Patents 
P.O. Box 1450 
Alexandria, VA 22313-1450 



Sir: 



The benefit of the filing date of the following prior foreign application filed in the 
following foreign country is hereby requested, and the right of priority provided in 35 U.S.C. 
§ 1 19 is hereby claimed. In support of this claim, filed herewith is a certified copy of said 
original foreign application: 

Australian Patent Application No. 2002952696 filed November 14, 2002 




HELLER EHRMAN WHITE & 
MCAULIFFE 




1666 K Street, N.W., Suite 300 
Washington, DC 20006 



Attorney for Applicant 
Registration No. 31,298 
Customer No. 26633 



Telephone: (202)912-2000 
Facsimile: (202) 912-2020 



Patent OfTice 
Canberra 



I, JANENE PEISKER, TEAM LEADER EXAMINATION SUPPORT AND 
SALES hereby certify that annexed is a true copy of the Provisional specification 
in connection with Application No. 2002952696 for a patent by GENOMICS 
RESEARCH PARTNERS PTY LTD as filed on 14 November 2002. 



WITNESS my hand this 




JANENE PEISKER 

TEAM LEADER EXAMINATION 

SUPPORT AND .SAT .RS 



r 



14-n-02;22r52 JOAVIES COLLISON CAVE Pat.&Trad 



;61 7 3368 2262 



# 4/110 



Regulalion 3.2 



AUSTRALIA 
Patents Act 1990 

PROVISIONAL SPECIFICATION 
for the invenlion entitled: 

^Status Determination" 

The invention is described in the foUowing statement: 



14-11'02;22 1 52 ;OAViES COLLISON CAVE Pat,8iTrad 



;61 7 3368 2262 



5/110 



1- 



gTATUS DETERMINATION 
Background of the Invention 

The present invention relates to a method and appaiatus for determining the status of a 
5 subject, and in particular for detcnnining the ability of a subject such as a boise to compete 
in a sporting and/or raciiig event. 

Description of Uie Prior Art 

The reference to any prior art in this specification is not, and should not be taken as, an 
10 acknowledgement or any form of suggestion that the prior art forms part of the common 
g^eral knowledge in Australia. 

A condition of a performance animal, for example a racehorse, may typically be 
determined by conventional means such as a blood piofQe test and clinical appraisal. 
15 However, these tests are of limited value because a correlation between results of a blood 
profae test or clinical appraisal and a condition or state of a performance animal is 
minimal. 



A blood profile test may be suitable for providing some information in relation to an 
20 animal that is clinically diseased or ill, but is rarely suitable for determining fitness to 
perform of an animal, particularly if the animal is healthy according to use of current 
dinical appraisal methods, and particularly if the animal cannot communicate information 
about its condition. Although blood profile tests are relatively inexpensive and easy to 

perfoon, they do not provide assessment of a wide range of conditions, correlations 
25 between test results and conditions of performance animals are poor, are limited to 

assessment of a few diseases, and arc sometimes only useful in assessment of advanced 

stages of disease where dinical intervention is too late to prevent significant loss of 

perifonnance. 



30 



Alternative diagnosis or assessment procedures are often complex , invasive, inconvenient, 
expensive, time consuming, may expose an animal to risk of injury from the procedure. 
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and often teqiiire tianspoit of the animal to a diagnostic centre. 

A final report of the results of a blood test to an end usor, eg. a trainer, often requires 
invoivemeni of rnultiple parties each providing separate input to the report. For example, a 
5 veterinarian may collect a blood sample flic sanq>le is transported or sent to a laboiatory 
for analysis, personnel in the laboratory perform an analysis using machinery on the blood 
sample, automated results from the analysis, with or without a veterinary pathologist 
interpretation, are returned to the veterinarian who then interprets the results and provides 
a s^rate report to the trainer. The process is laborious, time consuming, sul»ject to error 
10 and interpretation bias and may or may not contain infonnation relevant to the end user. 

Bioinformatics may be used with genetic based diagnosis of an animal's health. 

Currenfly, it is known to use genetic information in determining information rc^rding an 
IS indhridual. ITiis can be achieved in a number of ways <&^ding on the information that is 
desired 

Thus, for example* WO 01/25473 describes a method of characterising a biological 
condition or ag^ using calibrated gene expression profiles. In this case, when a subject is 
20 suspected of having a condition, a test is performed to obtain a specific profile, which is 
then analysed. In particular, flie collected profile is compared to a predetermined profUe to 
determine if the condition has been correctly identified. However, this suffers from 
drawbacks in that a preliminary diagnosis is required to aUow the correct test to be 
performed. 

25 

US 6,287,254 describes a system that allows users to perform DNA genetic profiling to 
determine the susceptibility of a subject to a condition. Jn particular, in this example, the 
subject is profiled to determine the presence of predetermined genes, which in turn indicate 
the susceptibility of a subject to a respective condition. Again, this requires specific tests 
30 for specific conditions, and only allows the susceptibility of a subject to be determined. 
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Summary of the Present Invention 

In a first broad form the present invention provides a method of determining the status of a 
subject^ the method including: 

a) Obtaining subject data, the subject data including one or more parameter values, at 
5 least one of the parameters being indicative of the current biological status of the 

subject; 

b) Conqparing the subject data to predetermined data, the predetermined data 
including for each of a number of individuals: 

i) One or more values for the parameters for the respective individual, at least 
^0 of the individuals having a number of conditions relevant to the status of 

the individua], the number of parameters being statistically sufficient to 
distinguish each of the conditions; and» 

ii) An indication of the status of the respective individual; and, 

c) Determining the status of the subject in accordance with the results of the 
15 comparison. 



The number of parameters is typically greater than about 100. and prefBrably between 
about 1000 and about 6000, As used herein, the term ""abouf refers to values (e^g., 
amounts, concentrations, time etc) that vary by as much as 30%, preferably by as much as 
20 20%, more preferably by as much as 10%, even more preferably by as much as 5%, and 
still even more preferably by as much as 1% to a ^)eclfied or reference value. 

The method typicsdLy includes determining any conditions displayed by the user. 

25 The method can also include determining the ability of Uie subject to perform in a sporting 
and/or racing event in accordance with any determined conditions. 

The method of performing the comparison usually includes: 

a) Obtaining a set of templates, the set of templates representing differences between 
30 groups of individuals; and, 

b) Using tiie templates to classi fy the subject data into a respective one of the groups. 
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The method may further include determining one or more conditions displayed by the 
subject in accordance with the determined group. 

5 

The parameters can be representative of the level or abundance of an agent in the subject 
or in a biological sample obtained from the subject. The agent may be selected from one or 
more of: 

a) A nucleic acid molecule; 
10 b) A proteinaceovs molecule; 

c) A carbohydrate; 

d) A lipid; 

e) A drug; 

f) A chemical; 
15 g) A gas; 

h) A cell; 

]) A pathogenic organism; and, 
j) A nonpathogenic organism. 

20 By "obtained froni* is meant that a sample such as, for example, a nucltic acid extract or 
polypeptide extract is isolated from, or derived from, a particular source* For example, the 
extract may be isolated directly from a tissue or a biological fluid isolated directly froni the 
subject 

25 Other parameters can be measured however, such as the near IR of the subject^s blood, 
general measurements^ such as temperatiu:e, or other biological indicators. 

The method usually includes: 

a) Receiving confirmation of the determined status; and, 
30 b) Updating the predetermined data in accordance with the confirmed status and the 
subject data. 



Hi 
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The predetermined data can indiide phcnotypic information of the individuals^ and the 
subject data can include phcnotypic information regarding the subject^ the phenotypic 
information including details of one or more phenotypic traits. 

5 

In this case, the method can include comparing the subject data to predetermined data for 
individuals having one or more phenotypic traits in common with the subject. 

In a second broad form the present invention provides apparatus for determining the status 
10 of a subject, the apparatus including a processing system adapted to: 

a) Obtain subject data, the subject data including one or more parameter values, at 
least one of the parameters being indicative of the current biological status of the 
subject; 

b) Compare the subject data to predetermined data, the predetermined data including 
15 for eadi of a number of individuals: 

i) One or more values for the parameters for the respective individual, at least 
some of the individuals having a number of conditions relevant to the status of 
the individual^ the number of parameters being statistically sufBcient to 
distinguish each of the conditions; and, 
20 ii) An indication of the sUtus of the le^ective individual; and, 

c) Determine the status of the subject in accordance with the results of the 
comparison. 

The apparatus can ther^ore be ad^tcd to perform the method of the first broad form of 
25 the invention. 

In a third broad form the present invention provides a computer program product for 
determimng the status of a subject, the computer program product including computer 
executable code which when executed on a suitable processing system causes the 
30 processing system to perform the method of the first broad form of the invention. 
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In a fourth broad form the present invention provides a method of allowing a user to 
determine the status of a subject, the method including: 

a) Receiving subject data from the user via a commum'caiions network, the subject 
data including one or mote parameter values, at least one of the parameter being 

5 indicative of the current biological status of the subject; 

b) Causing the base station to: 

i) Compare the subject data to predetermined data, the predetermined data 
including for each of a number of individuals: 

(1) One or more parameter values for the respective individual; and, 
10 (2) An indication of the status of each individual; and, 

ii) Determine the status of the subject in accordance with the results of the 
comparison; and^ 

c) Transferring an indication of the status of the subject to the user via the 
communications network. 



15 



20 



The method generally includes: 

a) Having the user determine the subject data using a remote end station; and, 

b) Transferring the subject data from the end station to the base station via the 
communications network. 



The base station can include first and second processing systems, in which case the method 
can indude: 

a) Ttensfening the subject data to the first processing system; 

b) Transferring the subject data to the second proc^siug system; and, 
25 c) Caudrig the second processing system to perform the comparison. 

The method may also include: 

a) Transferring the results of the comparison to the first processing system; and, 

b) Causing the first proocssit\g system to determine the status of the subject 

30 



In this case, the method preferably includes at least one of: 
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a) Transferring the subject data between the communications network and the first 
processing system through a first firewall; and, 

b) Transferring the subject data between the first and the second processing systems 
through a second firewall. 

5 

The second processing system may be coupled to a database adapted to store the 
predeteimined data, the method including: 

a) Querying the database to obtain at least selected predetermined data from the 
database; and^ 

10 b) Compare the selected predetermined data to the subject data. 

The second processing system can be coupled to a subject database, the method including 
storiqg the subject data in the subject database. 

It is also possible to implcmem any one of the features of the first broad form of the 
invention. Thus, for example, the status may include details of any conditions of the 
individuals, in which case the method can include determining any conditions displayed by 
the user. The method may also include detexmining the ability of the subject to perfomi in 
a sporting and/or racing event in accordance with any determined conditions. 

Similarly, the method of performing the comparison typically includes causing the second 
processing system to: 

a) Obtain a set of templates, the set of templates representing differences betwera 
groups of individuals; and, 

b) Use the templates to classify the subject data into a respective one of the groups. 

The method can include having the user determine the subject data using a secure array, 
the secure array having a number of features each located at respective position on the 
anay, and a reflective serial number. In this esse, the method typically includes causing 
30 the base station to: 

a) Determine the serial number from the subject data; 
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b) Detennine a layout indicating the position of each feature on the array; 

c) Determining the parameter values io accordance with the determined layout, and 
the subject data. 

5 The method may also include: 

a) Receiving confirmation of the determijied status jfrom the user; and, 

b) Updating the predetermined daia in accordance with the confirmed status and the 
subject data. 

10 In this case^ the features can include at least one of: 

a) An oligonucleotide; 

b) Ap^tldc; 

c) An antibody; 

d) A carbohydrate; 
15 e) A lipid; 

f) A cell; and, 

g) An organism. 

The method can also include causing the base station to: 
20 a) Determine payment information, the payment information representing the 
provision of payment by the user; and, 
b) Perform the comparison in response to the determination of the payment 
information. 

25 In a fifth broad forin the present invention provides a base station fcr determining the 
status of a subject, the base station indudixig: 

a) A store method for storing predetennined data, the predetermined data including 
for each of a number of individuals: 

i) One or more parameter values, at least one of the parameters being indicative of 
30 the cuirent biological status of the individual; 

ii) Anindicationof the status of the individual; and. 
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a) Obtain a set of templates, the set of templates representing differences between 
groups of individuals; and, 

b) Use the templates to classify the sul^cct data into a respective one of the groups. 

5 The method may further include determining one or mote conditions displayed by the 
subrject in accordance with the determined group. 



The subject data may be determined using a secure airay, the secure array having a number 
10 of features each located at i«spective position on the array, and a respective serial number, 
the processing system being adapted to: 

a) Determine the serial number from the subject data; 

b) Determine a layout indicating the position of each feature on the array; 

c) Determining the parameter values in accordance with the determined layout, and 
IS the subject data. 

The processing system can be adapted to: 

a) Receive confirmation of the determined ability; and, 

b) Update the predetermined data in accordance with the determined ability and the 
20 subjea data. 

The base station of tiie fifth broad form of the invention may therefore be adapted to 
perform the method of the fburth broad foim of the invention, 

25 In a sixth btoad form the present invention provides a computer prognim product for 
determimng the status of a subject, the computer progtam product including computer 
executable code which when executed on a suitable processing system causes the 
processing system to perform the method of the fourth broad form of the invention. 

30 In a seventh broad form the present invention provides an end station adapted to determine 
the status of a subject, the end station including a processor adapted to: 
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a) Ueterimne sum paiametet being 

data iDcluding one or more parameter values, at least one of tftc pax 
indicative of the cunent biological status of the subject; the 
L^er the subject natter to a base station via a cotnmunrcauons network, the 



5 base station being adapteo to: .^a„m«5,1<« the 

piedeternnned data including: 

(1) One or n«>re parameter values for the respective individual; and, 

(2) An indication of the status of each individual; and, 

ii) St^ne the status of the subject in accordance with the results of the 

Kerr:i::S.on Of the status Of .he subje. ^ 
comnrnnicatioiis network. 

« T»e a- suae i. .«.ic^y •» "opc«.c -he « 
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invention. 



25 



30 



C^Jlg a.». *c »ycc. par^n^cr values.^ 

« ^ su^jea da. » P«-e«n.t«d a«a, a» P»a«e»i«d d«a 

including for each of a number of individuals: 

i) Oneormoieparametervaluesfortherespectiveindividualjand, 
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Thc processing system is generally adapted to perform the method of the ninth broad form 
of the invention. 

In an eleventh broad ibnxi the present invention provides a computer program product for 
5 determining the ability of a subject to perfonn in a sporting and/or racing event, the 
computer program product including computer executable code which when executed on a 
suitable processing system causes the processing system lo perform the method of the 
ninth broad form of the invention. 

10 In a twelfth broad form the present invention provides a method of providing secure arrays 
for use, each array including a number of predetermined features, the method including: 

a) Determining a number of respective feature layouts, each layout representing the 
positioning of each feature on a respective array; 

b) Determining a number of serial numbers, each serial number coiresponding to a 
15 respective layout; 

c) Generating a number of arrays^ each array being generated in accordance with a 
respective layout, and including the corresponding serial number thereon, the serial 
number being used in processing the array. 

20 The method can be performed to provide the arrays on behalf of an entity, the method 
including providing an indication of the layouts and corresponding serial numbers to the 
entity, to thereby allow the entity to process the arrays. 

The method of determining the layouts typically includes: 
25 a) Determinii^ a preferred layout; and, 

b) Moving the position of one or more of the features from the position in the 
preferred layout to alternative position. 

The method can include: 
30 a) Determinimg the type ofcach feature; and, 

b) Exchanging the position of one or more features having different feature types. 
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Brief Descripdon of Che Drawings 

An example of the present invention will now be described with reference lo the 
accompanying dmwings> in whidi: - 

5 Figure 1 is a schematic diagram of an example of a processing system for implementing 
the invention; 

Figure 2 is a flow chart outlining the process implemented by ihe system of Figure 1; 
Figure 3 is a schematic diagram of an example of a distributed architecture for 
implementing the invention; 
10 Figure 4 is a schematic diagram of an example of one of the end stations of Figure 3; 

Figures 5A and SB are a flow chart of the process implemented by the system of Figure 3; 
Figure 6 is a flow chart of the process for generating templates; 

Figures 7A and 7B aie a flow chart of the process of comparing the subject data to the 
templates; 

15 Figure 8 is a schematic diagram of a second example of a distributed architecture for 
implementing the invention; 

Figure 9 is a flow chart of the process for generating secure arrays; and. 
Figure 10 is a flow chart of the process for generating subject data vsing the secure arrays. 
Figure 11 is a flow chart of the process of data mining; 
20 Figure 12 is a flow diagram iUustrating dataflow stqps in a spcciflc example as part of a 
computer system capable of delivery of remote diagnostic services; 
Figure 13 is a flow diagram showing steps for diagnosing a condition of an am'mal in 
accordance with a specific example; 

Figure 14 is a diagram illustrating an environment for working the specific example shown 
25 in Figure 13; 

Figure 15 is a flow diagram iUustrating steps for preparing an array in accordance with a 
specific example of the invention; 

Figure 16 ^ a flow diagram showing steps for determining a nucleic add expression level 
in a biological sample; and 
30 Figure 17 is a flow diagram illustrating steps for building a database in accordance with a 
specific example. 



I 
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Detafled Description of the Frefen^ Embodiments 

An example of the present invention will now be described with reference to Figure 1, 
which shows a processing system suitable for iraplementing the present invention. 

In particular, Hgure 1 shows a processing system 10 including a processor 20, a memoiy 
21, an optional input/omput (I/O) device 22 and an interiace 23 coupled together via a bos 
24. In use, the interface 23 is adapted to couple the processing system 10 to one or more 
databases shown generally at 11. 



10 



In use, the processing system 10 is adapted to receive subject data, which is data 
representative of the current biological status of a subjecL The subject data is typicaUy in 
the fonti of raw data and therefore reqirires interpretation to aUow the status of the subject 
to be determined. This is achieved by having the processing system 10 compare the 
15 subject data to predetermined data stored in the database 11. The picdetermined data 
includes data representative of the biological status of a number of individuals, together 
with an indication of the actual status of the individuals when the predetennined data was 
collected. 

20 Accordingly, by comparison of the sutaect data with the predetermined data, this aUows 
the subject data to be interpreted and the current biological status of the subject to be 
determined. 

The manner in which this may be achieved will now be described in outline with respect to 
25 Hgure 2. 

In particular, at step 100 the user determines subject data In the fbrm of parameter values 
representing the cunent bioiogica] status of the subject. In particular, the parameter values 
represent specific measurements of selected parameters that represent the current 
30 biological status of the subject. It wiD be appreciated that a number of different forms of 
parameters may be used, as will be described in more detaD below. 
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15 



This p^ce-tare ^ ^ ^ 
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It therefore be appreciated that determination of the predetennined data is typicaHy a 
ti«e™„ga„dexpensive procedure. I. is not the^fo^ 

to an^le^ent the ntethod to collect their own predetennined data. Accordingly. Z 
5 ™ .s typically implemented nsing a distt«,„ted processing system an ca^p e of 
which IS shown in Figure 3. oi 
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3U withm a company or the like. 
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WhJ.. Ms «ch™<,„c descrtbes .ransfem-g a,e datt eIe«ro„icIIy vi. to comm»A«k»„ 
i. wiU ,!«, be possible 10 a>n^ d.,a vi. alttmali« K<*m,p«s «Kh « 
•™«fem«g datt fa . had, or pri„,cd fonna^ as wen as ««fcm„8 fte data decwmcally 
« a physical medico, such as a floppy disk, CD-ROM or the lite. Wi«Ie« trwsfcr or the 
5 '*«»»'»I»«ibIe,aswiabeappnK»atedbythcp«aa,rtiBedinthean. 

In any event, i. wiU be apptcciated that i„ Utfs ex^nple, .he a«vi«, provided by tf» base 
stafonl are genenmy accessible Via U„teen». 2. Tbep^c^sCt^syst^nlOisthe.^ 

end statio.^ Acco*,g,y. tt. processing sy«em 10 may be any suitable ibnn of 
processing system a^t ^„ .pp„,pd«c applica,i„« «,a^ i„ ^ ^1 
to allow the d«*ed f^n^ ,o be «*ieved. TypicaUy however the base station 1 
includes a precesang ^aem. such as a twwo* server, web server or the Mke. 

15 Similarly, «rf station. 3 .nu« be cpable of con.„™ica,i„g with the ba«. sOttio. 1 to 

aUow browsing Of web pages, or ate tr«sfer Of data in other nuinners. Accotdh^ly. as 
^wn.nFigure4.inthi,a.a„^^«^«^^^3„^^^^^^ 

a processor 30. a n«„.o.y 31. «, input/o.,^ (,/0) device 32 and an interfece 33 
coupW toge«»rvi..bns34. 33. whici, may be a n«w.,* in.«^ ca«l or 

20 *«'*c»-«ed.oco„pled.eendst.tiont.fl»Lne.n..2oroneofthe,e.pecUveLANs4. 

I. wm rtceibre u appreciated tha. u» e«l s«ti.n 3 may be fcr^d i»on. any salable 
pr«e^ sy«em such as a suitably prc^rammed PC, Terminal, Up*,p. hand 

« «•» wWch is typi.^ eper-ing appOcation soitwate to e«b^ ^ 
25 *^ or la- Alterttativcly, tb. «.d s^ion 3 may be fbrmed ^ spedali^ 
■-iw-^ such as an elect^mic touch s««iBvc sc^ coupled to suitable processor and 
J-noo-. b.^tothi^.h.«H.«atio.s3m.ybc comtected to the internet 2 or the 
l^s 4 V,. wned or wireless conn«=,io«, a. win be appr^iated by a pe«on skilled in to 



art. 

30 



Operatioa of the system to determine the status of the subject will aow be described in 
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more detail with reference lo Hguiss 5A and 5B. 

<'"'™'»-S'^P^«-va,„.sf6r.hesul,e«.TT»p.»«,«e,,.^„.„a^ " 

r2rr.::'t d«a « pred«e™uncd 
can Dejnteipreted by the base station 1. 



10 



15 



At step 220. the user accesses the base station 1 using the end station 3. 

At this stage, the user of the end station 3 will typicaHy be xeouited to rirf,. 
the base station 1 or supply a predeteonin^ ""^"^^ "^"^ 

oerfom,.^ ^ ., 7T P^d^t^^nnined user name and passworf. In particular, this is 

a^^Hot^ " se^icesprovided by the base JT. 

and/or to ensure that paymem can be obtained for the provision of the services. 

rrerr:re:.^r,r::rirr^i'— 

I«vi*»» for pay™«,, such a, .b. oJ^ 

•n.. ».r ™ne pa«.^ wfl. u«, 6. „^ ^ s.bse,„em.y verified in d« 



30 

In any event, when the user acc-e^ w 

the user accesses the base station 1, this is typicaUy achieved by 



14-n-02;22 :52 ;OAV!£S COLLISON CAVE Pat^&Trad 



;61 7 3368 2262 



# 2 4/110 



-20- 

accessing respective web pages generated hv th^ k 



5 °^*e"«'l>as(«cnaotlions6d. the user wffltem^ ^ 

«««<„, 1 and a,. «,d station ° _ . ^'"^ ^J'") «>»neaion between the 
ensute that .i. su^e^lr^^r*"" -Ht, a.K, p,„ieuU„ to 
10 retail coafld. -»se «a.i„„ 1 aad e«l «a.io, 3 is 

events «..s i„fcn«a*„ 1 ^1 """^ « -^o. racing 

After accessine the base station 1 at step 220 the sub.WH. • 
s.«onlatstep230. At this poin, .^^^^ 7^^^^^^ 

s«bjectc(atatoc.s^ehatitisgen.i„esubJcctCaL2r^^ T"" ""'^ 
<fisguisc an atten,pt to hack into the t^^. . 

received. °' ^ ^'""^ unwanted data i$ not 



20 



25 



In any event, at step 240 the processing 
subject data. 



system 10 operates to detcnnine the nature of the 
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At sKp 250 itejroc^dBgsyaem 10 selects mIcm™,,^.^. . 

10 will ,p««e ,„ «,e« pa:.^ J„ 'X"- 



i-*>™atio. ae '» °« <=™bodu«n^ fte p»«„,,^ 

«>iar RNA (snoRNA). small cytoplasmic RNA 



30 
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(scRNAX ribosomal RNA (rRNA), translational control RNA rtcl^NA^ ^ . « 

interference RNA riRNA\ (nucRNA) or 

^ t^iyj^ liKNA) and mitochonddal RNA rmtRMA^ c •* ui 

5 honnones, cytojcines. ah^mr^w «nagrmes, receptors, immunoglobulins, 

PlVSi=-««l psychologic,, ch^^aie,. ••■~^«™>'yl"s.o^»dh«,U,blc 

respecUvccoaditionaiatBJdrMdv*.-™-. v ^ ""V be relevant lo a 

P^ieuhr .Us c«^ ^ """""^^ ««*«™os«, coa«Uo„s. I,. 

ijr advantageous as the aiumals arc unable tn nrr..»-j- • * 
.0 co^^^ ^ s^cC t^^^^^ 

example, race horaes can su^r from u ^ ho^^s for 

'""^^ ' «»^ons, such as ovcrtndnfng. 
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J of subject «di.„.„- ,l "°^''^t»^^<ov^^ 

However, it wm be Hv»<*i«d «!« this fa rfso fbr a- 

10 Wit. .tMcL^lT^ T ™""»»«-"«^*«-« 

t. Erectly, but n»y tav, „ ...pact o. the atMe.c'5 pe*™^ 

to keep the cOTdWon secna. for Mamnl. ,k . 
15 «b«a..cesu»edby,he«Wc,l '^ '^""'^ 

-.-.fore b.ve ,0 be .^c^ ^JL^^^"^' 
«. individual fc,m . »^ oL,!° ■* «£ 

-«c«s.^ to s^lT^ZT'.T' ' ^ is , £. ,00. it wiM be 

25 individu.,^ .„ enL« jTo W^l to «,.ple « fc^ ,oOO 

Ftokennore, ibe more .v.iw,Ie lh», individuab a*ori„. ft„ 

•WW as Uds allows dishw™,..^.. . «*=™>« *>"HI>e condition the 

30 

The number of parameters required wiU dcnend ^ 

eo will depend on the number of conditions to be 



20 
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25 



disUnguished. In particular, it will depend on factor such as: 

• The presence and detection of unknown conditions; 

• Therangeofconditioiistobeidenlifiedv^-tlimthepopulation; 

• "I^elevelsofincideiioeofcachconditioninthepopulation; 

• The ability 10 distii^h between the conditions. 

It wiil be appreciated that as individuals, indudi.^ performance animals such as race 
horses, can suffer from a wide variety of conditions, then it is preferable for a large number 
Of p^cters such as 3,000 to 5,000 to be used. However, this number can be 
significantly lower if only a minor number of conditions ar^ to be identified. H.us for 
example, the number of panimet*«s used may be anywhere fi<.m 10 up to 10.000. or more 
Suitably, the number of parameters employed are at least about 20. preferably at least 
about 50, more preferably at least about 100. even more prefcmbly at least about 150, even 
«OTe preferably at least about 200. even more preferably at least about 300. even more 
15 preferably at least about 50^ even more preferably at least about 1000, even more 
Pi^erably at least about 1500. even more preferably at least about 2000, even more 
preferably at least about 4000, even more preferably at least about 6000, even mor. 
preferably at least about 8000. and stiD even more piefe^iy at least about 10000. 

In addition to this, the effect of a condition on an individual may al^ va^ in ac«>tdance 
with additional phenotypic information relating to a particular characteristic or set of 
^laacxistics of the subject, as detennined by interaction of the subject's genotype with 
the environment in which it exists. In this embodiment, such 'characterisuc data> may be 
selected ftom ^c, sex, height, length, weight, ethnicity, race, breed of ammal, feeding 
Pa«e«.. exerdse patterns, medication supplied, nutritional or growth supplements 
supplied, nutritional analysis, hair colour, sldn colour, eye colour, body composition, fet 
composition, wafer retention, obesity, transcnptomic profile, proteomic profile, blood type 
tissue type, endocrine function, immunological fenction including cellular and hmnorai 
«nmune fimction, tolcmnce. aUergy. t«nsplant refection, can«r, hypeT,lasia 
Sa^ointestmal ftmction, neurological function, kidney fimcuon, heart function, brain 
fiinction, pancreatic function, bone fenction, joint fimction, sexual or reproductive 



20 
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'=l»<«ter«ttc d«, ttgea„ wift tte exprcsdon data fcf the im^^ 

dete^s^ ft„ ,„ ^, 

P«d«crnuned d«. ^^^^ ^ p^^^ of U:, p«*«,„i„d date is 
ip-^ pe,*»„ed. „ d«. ™to.„g »d will be 

d«»ibed „ ™o„ deuul below «^ vriB now be o«Uned wi.h r^oce «> Hg„,» 6 



IS 



In^o-a. « «ep 300 fte opc^o^ of .he b« i ope«. » eo«e« 



^*»«^««*''«»«su«.eda«ftep.™^™,^,^.,,^,^ 
rnZT"^ «> «<p™«.«. dau in p.^, , ^ 

or »«« of >c^v. «*«^„ p^ obBi^d fto™ an a^y or U.e iite 

i*™«io„ wi,. .ypu:.ny be p„Mded a s«K,y of ^ ^especVe 

25 conccfly interpreted by the base station 1 . 

Each p,™...., v«,or i. ,on.ed f™„ . v.«o. c„„.i„i„g each ot fte 

30 J«n ,0 tbeo operles to co-^de, ,he po««o„ „, ^ p.^ „ 
«i»cc. wiic.. N oonc^iKi, to ««= numb., o,p.,,„^ „ 3 
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At step 330, tlie pioecssmg system ,o ooeru^ ,„ rf„, • 



15 



individuals iuiving similar physical condiiio,^^ 
Thus, the processing system 10 will operate at sten ^ . ^ 
«d .0 Cassia ,^ ^ »oJt !rr 

niulSpIicio. of ways of defi™. It ^Ibe w«i.ted .ha. th«B is a 

l«*«fonusfagU,e..n*l,te. ^ """^y P-ov^^ aatisticaUy efficfen. 

ft ««l be appiedMed .ka. tte <tefi«M<m of g^^s ™, be d»„. f 

^ "'*"^*«"P'"«^vo„fe,cl,i.««vid«,l*pten«j^fc 
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described above. piXHjessmg system 10 as 



15 



20 At stq) 430. the processins system 10 e^t™^ tu 

data fix,x„ the subiect ^7 ^"'""^ "^'^ phenotypic 

vectoH; st^^Z!" ^ta, and tbeo uses the panunete, values to ^nerate a pa^eL 

The piDoessing system obtains one or moie «f th» i ^ 
30 ^°"P**^^®™^^^"5e8havn,g a rcspiratoiy disease. 
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traits, ihen this template will be us^rf fr. 

««« » P'***^' membership usine the auhfc^ 

parameter vector at sten >» . •«•. me subject 

preset.. ^pir rx^r'"'^"™"'^^^^^^^ 

muhipUdty of ways of constL^ ^ « 

sldUed in the art tempfetes, a$ will be appreciated by 

persons 



templates mav corresn^r.^ . ^ Alternatively however the 

F may correspond to ranges such that each tRmrsUt^ 

respective 

the parameter vector is approximately equidistant ir. tr^ 
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I. Win ^ be ,pp^ u„ ^ u,.^of^f^r «,u« «iu =a,c„ «,e 



la „y ev««. « «ep 48ft u» proce«i«g sy«em ,0 ta.:p„„ ,be «p,^„„ ^ 
A number c™ be imple««nied u> U» p,.«»« teventioD. 

Multiple Firewall 

30 ^p«i«tar, i. «e .b<.c descdbed .x^,e « ^ be app^ „«„ ^.^ 



9 
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^« Of fl« end 3 a..«np. «, taO^ a» p„«si„g 3^ ,0 „^ „„„ ^ 

co™,„^-„ Witt. U« ««i»3 3. ^. 
25 w'"<*is4«enni.«dby-ttwi»o«ss.„ESyaeml2t„h... • ,. . °""««". 

<»™l»nson»Ml.lliei>rede.emii,.e<ldi.ta. <»«q»att 
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Once the comparison has been performed, the processing system 14 can detennine those 
conditions sujBfcred by the subject and then transfer an indication of this back to the 
processing system 12 through the firewall 15. The processing system 12 can then transfer 
5 an indication of this indication to the end station 3. 

It will be apprcaatcd that in this example even assuming the user is able to infiltrate the 
first firewall 13, the user will only be able to access previously submitted requests and the 
results determined therefrom. The presence of the dual firewall system therefore makes it 
10 virtually impossible for the user to infiltrate the processing system 14 and obtain access to 
the data stored in the database 1 1, 

In the remainder of the description^ it will be appreciated that the processing systems 10; 
12, 14 are effectively interchangeable. 

15 

Parameter Ranges 

A further alternative to the present invention is for the comparison to be performed on the 
basis of parameter ranges defiiwd for different conditions. 

20 Thus for example^ each condition may have associated therewith a sequence of parameter 
value raAges determined based on ranges of parameter values for individuals diagnosed 
with the respective condition. The parameter value ranges can then indicate for a 
respective condition the parameter values that can be expected^ allowing the determined 
parameter values to be compared to the respective ran^ for each condition to determine if 

25 the parameter values provided Ml within a respective range. 

Thus, a respective panimeter range can be determined for each condition, with the 
parameter values determined for a subject being conq^ared lo each range» to determine 
those ranges within which the subjea data falls. 

30 

An indication of the likelihood of the subject having a respective condition can then be 
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10 



detennined statistically based on the number of individuals having the respective 
condition. 

Multi-Level Analysis 

In the example described above, a number of conditions have been defined fbr the 
respective type of individual. However, it wiU be appreciated that sometimes it is 
desirable to perform tests to focus on specilic conditions. Thus for example, in the case in 
which a horse has an existing condition, it is sometimes desirable to monitor 4he 
development of the condition for the respective subject. 



In this case, as the condition has been determined, it wiU not usually be necessary to 
consider all of the parameter values each time the analysis is performed. In particular, as a 
large number of parameters are provided to aHow the different conditions to be 
distinguished, a large number of parameters will typicaUy not be representative of the 
15 progress of a specific condition. 

Thus, it is usuaUy possible to identify a number of key parameters that are relevant to 
respective conditions. Thus for example, conditions relating to respective respiratory 
illnesses may be uniquely identified using a smaller number such as 50 parameters. In this 
20 instance, if the user is only interested in examining for the progress of this respective 
condition, the user can simply supply an indication of the values for the respective 50 
parameters. 

In this example, the processing system 10 would operate to compare the determined 
25 parameter values against parameter values of horses suffering fi-om the condition and 
horses not suffering from the condition. In this situation, the manner in which the 
collection of the parameter values is performed may very. 

In the examples described above it has been mentioned that the parameter values may 
30 include for example expression data coDected using an array, for example. If the arrays are 
to collect values correqwnding to 5000 parameters it is typical for an array to be provided 
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10 



with 5000 features thereon with each feature correspoiKling to a respective parameter 
Alternatively, TO.OOO features may be provided with two features concsponding to each 
parameter, m any event, a person skflled in the art wiU appredate that a nwnber of 
Venations on this arc possible* 

However, if only 50 parameters arc to be meastu^ it is then possible to provide an array 
having 5,000 features with 100 features being used to determine the value ft>r each 
parameter. This aUows the parameter values to be determined for more accurately 
allowing a more accurate representation of the comiition to be detennined. In particular 
more accurate comparison of the subject data with the predetennined data can be' 
performed. 



Thus, a typical sequence of events may be for a user to submit a general test having a laige 
number of parameters similar to that described above which allows respective conditions 
15 to be first identified. Once a condition has been identified, the user «n then purchase 
speafically designed array plates adapted to monitor the specific condition. Measurements 
of the parameter values relevant .o the condition can then be made fer more accurately 
aUowmg the progress of the condition to be monitored in detaiL This can allow users to be 
^ P^^^'dedv^thinfoimationconcerningwhetheroonditionsareimprovingorn^^ 

Longitudinal Analysis 

In the methods described above, the subject data fi,r a respective subject is compared to 
predetermined data for a number of differem individuals. However, in addition to or 
^temadvely to this, longitudinal analysis can also be performed In tMs insuncc,'the 
25 subject data is compared to subject data pn^viousFy collected for the same subject. Thus, 
thw aUows the progression of a condition within a subject to be monitored. 

Again, it will be appreciated that if this is performed with a hmited number of parameters 
as described in the multi-level analysis described above, then this allows an accumte 
30 assessment of the progression of a condition to be made. 
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By storijig the results determined for a re^jective subject in the database 11, for a 
predetermined time period, this can allow the progression of the disease over a time period 
to be monitored and displayed to the user. Thus, the most recently obtained subject data is 
compared to earUer subject data for the same subject (and optionaUy predetermined ata), to 
5 determine disease progression. 

Thus, for example, levels of reqyective parameter values can be used to indicate the 
severity of the disease. This can be achieved by comparmg the subject data to 
predetermined data in the manner described above, or alternatively using other techniques. 
10 As the parameter values vary over time, this can be used to provide an indication of 
whether the condition is improving or worsem'ng. This is turn can be used to monitor tbe 
effectiveness of any treatment ^ven to the siib|ect. 

Thus for racample. if it is determined that a boise has been overtrained then the obvious 
1 5 solution to this problem is to reduce training for a predetermined time period, or resting the 
horse. However, trainers wiU generally not want to reduce the traim'ng too much as the 
horse will become unfit. Similarly, worse problems can arise if the trainer resumes 
training too early. 

29 Thus, in this case, the trainer can submit subject data on a periodic basis such as every 
weeic allowing the fitness of the horse to be determined on a weekly basis. An indication 
of this can then be transferred back to the user allowing the trainer to determine when 
training of the horse should resume, or how hard training should be. 

25 This therefore allows the severity of the condition within the subject to be monitored. 
Secure Arrays 

Arrays, or genechip technology determines whether genes are turned "on'* or "ofT under 
certain conditions. A single gene is a stretch of DNA found in the nucleus of a cell that 
iO encodes, through mRNA, the information required to produce a particular protein. 
Genechips measure the level of mRNA in cells and can be direcdy correlated to the 
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amoufiL of protein being produced in a celL 

In the example of collect subjea data for horses, a horse array and a blood sample are 
needed. The array has DNA dotted onto its surfece (DNA of the genes in horse blood 
5 cells). The DNA on the array consists of one strand of the double-stranded DNA molecule 
- the other strand is provided by the blood sample and is labelled with a dye. 

Two strands of similar DNA will only bind to each another (hybridise) if they match in 
sequence. An array reader can determine the amount of mRNA in a sample (gene turned 
10 -on"* or ^off) by determining the amount of dyclabelled DNA that hybridises to an array- 

The reader produces a value compared to a reference for every single gene on the array. 
The 5,000 to 10,000 values can then be compared to Genctraks' database. Genes turned 
"on'' or "oflT, individually or in patterns, can then be identified and correlated to the 
1 S specific conditions of a racehorse^ 

Various conditions in racehorses will alter the metabofism of the white blood cells, which 
can then be detected using the genechip technology. For example, the gene for manganese 
superoxide dismutase (MnSOD) may be turned ^'ofT in respiratory inflammation. 

20 Similarly, lENg, ILr4 may also be turned "off, and the genes for Grola, IL-8, TNF and 
MIF may be turned **on". This pattern of "gene expression** can be correlated to a specific 
condition, such as respiratory inflammation caused by a virus. Patterns of gene expression 
diange as a horse succumbs to or recovers from a viral in&ctioa As the technology and 
database develops, predictions on the stage of infection or influence of treatments can be 

25 made. 

As described above, it is preferably to ensure that the predetermined data is retained as 
confidential. 

30 However, if arrays arc used in the collection of data, it will be appreciated that it would be 
possible to purchase a quantity of arrays and perform data minir]^ of data obtained from 
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15 



array ^^Vou'^'** 



20 
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At step 650 the array supplier will operate to generate arrays in accordance with the 
randomised array layouts and the serial numbers. In particular, each generated array will 
have features positioned thereon in accordance with a respective one of the randomised 
S layouts, together with an indication of the corresponding serial number. 

It is typical for the array supplier to produce the arrays in batches with up to 1,000 arrays 
in each batch, with each batch being created in accordance with a difference randomised 
layout. 

10 

At Step 660 the randomised arrays are transferred to the users for subsequent use in 
generating the subject data, whilst at step 570 the serial numbers, together with 
conejponding layouts are transferred to the base station 1. 

15 The use of randomised arrays will slig^y complicate the production process but will 
vastly inorease the security of the arrays. In particular, third parties wiD be unable to 
utilise the arraysi, as the location of features alter, which will cause the third parties to 
obtain varying results on different arrays^ fbr the same sample. 

20 Operation of die system to use the randomised arrays will now be described with reference 
to Figure 10* In particular, at step 700 the user will obtain a biological sample from the 
subgect and then perform an assay process using the array at step 710, 

At step 720 the user uses the end station 3 to encode the values obtained from the array as 
25 subject data, together with a serial number indication. The subject data is then transferred 
to the base station 1, in the manner described above» at step 730. 

The processing ^stem 10 operates to determine the serial number from the subject data at 
step 740. The serial number is then used to access the respective array layout stored in the 
30 database 1 1 at step 750. 
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The array layout will then be used by the processing system to interpret the subject data, 
and in particular, to determine the respective feature to which each value corresponds. 
This allows the processing system 10 to hence determine the parameter values for the 
respective subject data. 

5 

Operation of the invention will then be substantially as previously described above. 

It will be ^predated that the serial number may also be used to check the user is an 
authorised user. In particular, if each user is provided with arrays having a respective 
10 serial number (a range of serial numbers), then having flic army supplier provide an 
indication of the user and the serial number(s) to the operator, this allows the operator to 
verify the identity of the user. This provides an audit trail for the arrays. 

Feedback 

15 A further way in which the present invention may be utilised is to provide feedback on the 
accuracy of provided results. 

In particular, if the base station 1 is used to provide an indication of one or suspected 
conditions in a subject, the user can be requested to provide an indication whcth^ the 
20 diagnosis provided by the base station 1 is correct. This may form a requirement, such that 
a user will only be provided with services by the base station if they agree to this term. 

In any event, the correctness of the assessment by the base station 1 can usually be 
determined by either treating the subject and determining if the treatment is successful, or 
25 by monitoring the development of the condition over a predetermined time period. Once it 
has been determined that the diagnosis is correct or incorrect, an indication of this can be 
transfcrrcd to the base station 1. 

At this point, the respective subject data collected for the respective subject can be saved 
30 as predetermined data in the database 11, with the oonnrmalion of the condition being used 
as the indication of the condition in the predetermined data. 
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In order to achieve this, the processing system 10 is typically coupled to a sample database 
that is used to store the subject data obtained from each subject. Once confirmation of the 
conditions is received the subject data and the condita^on indication is transferred to the 
S predetemiined data stored in the database 1 1, 



It will be appreciated that this checking of the conditions is not essential to the present 
invention as typically the data alone will be usefiiL However^ checking of the condition 
will be useful in determining the accuracy of the templates. 

10 

It wiU be appreciated that as further data is collected over thn>agh the feedback technique 
or through the use of alternative data collection methods the templates or other data can be 
updated allowing more accurate condition analysis to be performed. 

15 Users 

It will be appreciated that any individual may use the system. iiiUally at least however, it 
is necessary for the user to be able to generate the subject data. In the case in which arrays 
are used, for example, diis requires the user to first collect biological material, sudi as 
blood, and then analyse the material using the array. This is generally difGcult and 
20 requires skilled operatois using existing technology. Accotdmgly^ the user may have to be 
a skilled tedmician. However, it is envisaged that collection techniques wiU become 
simpler^ allowing the process to be implemented by any user. 

In the case of sportiqg or radng events, for example^ the users could include: 
25 • Athletes; 

• Trainers; 

• Drug testing committees (such as Olympic QflScials); 

• Medical practitioners (such as vets or doctors); 

• Event organisers; 

30 • Pathology labs (that would typically perform the work on behalf of an individual, 
such as a horse owner). 



m 
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However, this is not intended to be limiting. 
RcpOTts 

5 In genenO. the indication of any conditions suffered by the user, together with infonnatwn 
oottcenli^8 the ability of the subjea to compete in events, or the like, is provided in the 
form of a report 



10 



It will be appreciated by those skxHcd in the art, that the content of the report may need to 
be tailoied depending on the type of user. Thus, for example, a trainer will not be 
interested in knowing about parameter values for their horse, but will «ther want to know 
what conditions the horse has, and the severity. In contrast, if the user is a skiUcd medical 
piacfitioner, then there may be some benefit in haviflg more detailed information provided 
thereon. 



15 



Accordingly, the processing system 10 can be adapted to generate tailored reports in 
accordance with report tempLites stored in the database 11, or the memory 21. In this case, 
the processing system 10 will determine the type of user, and then access a respective' 
report lemptate. The template will specify the type of Information to be provided to the 
20 user, aDowing the processing system 10 to populate the report in accordance with the 
results of the above inscribed analysis. 

Thus, for example, in the case of the user being a trainer, the processing system 10 can 
access a user report template, which wiU include a number of fields. The processing 
25 system win determine ftom the field the information required, and populate the fields 
accordingly. This may require some additional processing to place the information in the 
required form. The information wiU also be directed to a level the user can understand, 
and WiU therefore typically avoid the use of technical terms (such as medical terms) for 
non-technical users. 

30 

Thus, die processing system may be adapted to determine the condition and severity. Tids 
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is then used to access a look-up table, which indicates how serious the condition is to the 
subject Thus, the LUT may indicate that the condition is serious and medical condition 
should be obtained. In this case, the report may therefore indicate merely that the subject 
has a condition and medical attention should be obtained, h wiU be realised that the advice 
5 may depend on phenotypic data. Thus, a young horse may be more or less likely to require 
medical treatment for a given condidon that an older horse. 

For skilled medical pracriiioners however, more detail may be required, in which case,.the 
processing system may be adapted to indicate not only the condition and severity, but also 
10 provide an indication of various important parameter values (such as red or white blood 
cell counts), to aUow the medical practitioner to determine what action to take. 

It WiU be appredated that the information displayed may depend not only on the user, but 
also the respective condition. Hirthermore, the information could be displayed graphicaUy 
15 or as numerical or textual information. 

As the completfon of the report template is automated, it will be appreciated that users may 
be allowed to submit their own report templates, in accordance with predetermined criteria, 
allowii^ the user to have reports generated in their desired format. 



20 



Finally, the processing system 10 can be adapted to provide other advice. This can indude 
for example, recommendatioos for changes in feeding habits, or the like. In general 
medical advice would not be given due to the issue of liability. However, it will be 
appreciated that the operator of the base station 1 could provide a medicaUy trained 
25 individual to provide medical advice if required. 

Architecture 

A range of different architectures may be implemented in addition to those described 
above. Whilst these wiU not be described in detail, it will be appreciated that any fonn of 
30 architecture suitable for implementing the invention may be used. However, one 
beneficial technique is the use of distributed architectures. In particular, a number of base 
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stations 1 may be provided at respeaive geogi^hical locations. This can increase the 
efficiency of the system by reducing data bandwidth costs and requirements, as weU as 
ensuring that if one base station becomes congested or a &u]t occurs, other base stations 1 
could take over. This also allows load sharing or the like, to ensure access to the system 
available at all times. 



IS 



10 



20 



In this case, it would be necessary to ensure that each database 11 contains the same 
information and templates such that the use of diffiarcnt ones of the base stations 1 would 
be transparent to the user. 



Subject Data 

The subject data may be selected from any expression product of the genome or 
diaracteristic or set of characteristics of the subject whose levels or abundance may vary 
within the subject or between two or more different subjects depending on their status. The 
15 data include, but are not restricted to, biological, physiological and pathological data of die 
subjccL Examples of biological data include, transcrlptomic profiles, proteomic profiles, 
cn^mc function, receptor function, and the like. Physiological data may be selected from 
age, sex, height, lengdi, weight, cthniciiy, race, breed of animal, feeding patterns, cxereise 
patterns, medication suppUed, nutritional or growth supplements supplied, hair, skin and 
eye colour, fat composition, obesity, blood type, tissue type, endocrine function, 
immunological function, gastrointestinal fimctfon. neurological flincdon. kidney fijnction, 
heart fiincb'on. biain function, pancreatic function, bone function, joint function, 
prosthesis, tissue rcoonstiuction, surgeiy, pain and the like Examples of pathological data 
include infecHous disease including viral infiectfon, bacterial infection, mycobaaerium 
25 infection, parasitic infection, prion function, cancer, transplant rejection, inflammatory 
diseases such arthritis and fibrosis, toxicological profiles, substance abuse including drug 
dqwndcnc^ and the like. 



Data Mining 
30 Data Mining 
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The system uses a self learning classification system, in which diagnosis is made using a 
historical database of test results (the predetennined daiaX which is updated as each test 
sample (subject data) is recorded. The historical database is typicatty maintained on a 

server* 



Chmcal application of the system can be used to diagnose a subject such as an animal with 
an unknown dinical or perfoimanoc state. Ttuit is. the animal may or may not have some 
disease, or may or may not be race-ready. A metaboHc pwfile is measured forihe-ammal 
subject 



I» a piefened «tample, the metabolic profile is comprised of expression signatures 
measured on an oligonucleotide chip. In a preferred example the metabolic profile is 
compared with a set of pre-compated diagnostic signatures (templates), a„d together these 
are used to predict the health status of the subject In a preferred example, prediction will 
15 include probabilistic estimates of uncertainty, and be accompanied by a list of possible 
difTerentia! diagnoses. 



Diagnostic signatures are computed by data mining a historical database, which contains 
metabohc profile data on subject animals (predetermined data), and associated clinical 
20 ^-fonnation on sut^ect health and performance status. 

metabolic profile measurement technique as is used in clinical appUcation. In a preferred 
example, these metabolic profiles are comprised of expression signatures measured on an 
oligonucleotide chip. 

25 Data mining may be performed using a number of techniques including: 

• Regularised discriminam analysis for high dimensional data, as described by 
Kuven (1992) Canonical variaie analysis of high dimensional spectral data. 
Tedknometrics 34 pp. 321-331. 

• Diagonal discriminant analysis as described by S. Dudoit, J. Fridlyand, and T. P. 
30 Speed (2002). Comparison of discrimimition methods for the ctassification of 
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tumors using gene expression date. Journal of the American Statistical Association, 
97(457Xpp.77— 87. 

• Support Vector Machines as described by M P. S. Brown. W Noble Gnindy, D Lin, 
N Cristianim, C Sngnct, T S. Furey, M Are?, Jr., D Haussler ^000) Knowledge^ 
5 based analysis of microanay gene expression data by using support vector 

machines. Proceedings of the National Academy of Science. 97(l):262-267. and Y. 
Lee, Y. lin, and G. Wahba (2002) MuMcategory Support Vector Machines, 
Theory, and Application to the Classification of Microanay Data and Satellite 
Radiance Data. Technical Report 1064. Department of Statistics, University of 
10 Wisconsin-Madison. 

• Bayesian kernel fitting algorithms. 

• 1^ based recursive partitiom-qg Bieiman. L., Friedman, J., Olshen, R., and Stone, 
a 0984) Classication, augmented by Bagging Breiman, L. (1996) Bagging 
predictions. Machine Learmug 26(2) pp. 123-140 and Boosting Breiman, L.(1998) 

15 Arcing classifiers. Aimals of Statistics 26(3) pp. 801-849 

It win be apparent to practitioners skilled in the art. that other data mining procedures may 
be used to rephice those identified above, without materially changing the nature of the 
imrention. 



20 



25 



Diagnostic signatures are combined with test subject metabolic profiles to produce a 
diagnosis. In one example, (where data mining was based on regularised or diagonal 
discriminam analysisX prediction is based on a Baycs classification rule, and estimates of 
uncertainty are based on posterior probabilities of dass membership. 



In another example, (where data mining is based on Support Vector machines), 
classification is based on the suppon vectors, and uncertainties are estimated from distance 
of the test profile to the decision boundary. In another example (where data mining is 
based on recursive partitioning) classification is based on the estimated decision tree, or 
30 averaged over multiple decision trees. 
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It wfll usually be the case that even for an ammal with an unknown clinical conditioii or 
performance status, some clinical or perfonnance conditions axe known. For example, it 
may not be known whether or not the animal has disease A. but it known that the animal 
has disease B and does not have di:iease C. When test samples a« recorded, the historical 
5 database is updated to include the test sample, and any known concomitant clinical or 
perfonnance infonnation. 

It will usually be the case that an animal is tested more than once during a period of 
investigation. Ke-testing may occur at a time when an earlier unknown dinical condition 
10 has become known. For the example given above, it may be the case that at a ti„.e of rc- 
testmg for race^eadiness it is known thai during the imtial test the animal did have disease 
A. Provision is made to aUow updates to and modification of the dimcal data obtained for 
each test subject, as diagnosis Is confirmed or modified. 

15 In one example, data mining is repeated at «,gular imeivals as the historic database grows 
Test records added to the historic database wiU frequently contain only partial clinical or 
performance data. For any given dimcal or peilbnnance fector. data will be filtered to 
remove subjects for whidi the particular diamcteristic is um^corded. The data mining 
algorithm will then be used to construct new diagnostic signatures for the given clinical or 
20 performance diaracteristic ITte pK>cedure of filtering and mining is «peated for each 
characteristic of interest. In this way. the sample sizes used to obtain diagnostic signatures 
are constantly i„acasi^8, and predictive performance improves. lUe system becomes self- 
leaming. 

25 ^eovemU process is iflustrated by Figure 11: Which shows the flow of information 
processing in the sdf-learning diagnostic system. 

It is apparent that die Historical database must be initialised, and preliminary data mimng 
conducted before dinical application of the diagnostic system. The database will be 
30 mitialised using a training set comprising data ftom animals with known metabolic 
conditK>ns. Appropriate experimental design is vital to the construction of the imtial 
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traiQing data set. Empirical predictors derived using data mining are susceptible to 
artefectual relatiooships, involving nuisance factors - such as regional difierences in diet 
and husbandry. For this reason, the training dau set must be obtained from a multicentre 
trial, and stratified appropriately. 

5 

Specific Exajnples 

Spedfic examples are set out in more detail in Appendix A- These are for the puipose of 
demonstration only and are not considered to be limitins. 

10 Persons skilled in the an will appreciate that numerous variations and modifications will 
become apparent. All such variations and modifications which become ^parent to 
persons ddUed in the art, should be considered to fsdl within the spirit and scope that the 
invention broadly appearing before described. 

IS Thus^ for example^ the above description has focussed on the testing of a general subject. 
It win be appreciated that this is most advantageously used for performance animals to 
identify conditions that may lead to a decrease in performance. This allows trainers to 
identijy problems with horses or other animals before they would be noticeable using 
existing techniques. This is of particular benefit in the horse racing industiy as it allows 

20 problems to be identified in advance^ which can in turn allow the conditions to be 
corrected before they effect the horses performance, which in turn can result in a vast loss 
of eanungs for the trainers and own^ of the horse. 

However, the technique may also be applied to any subjects* including humans. 

25 

It wilj be appreciated that different predetermined data will be required for each type of 
subject being assessed. 
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APPENDIXA 

Figure 12 is a flow diagram iUustratiog one spedMc example of an infonnation technology 
architecture and data flow as part of a remote delivery service process External users are 
5 shown as Class One 505, Qass Two 510, and Qass Three 515 ihat are interested in 
obtaining information regarding thdr respective gene eTqiression results when using the 
proprietary gene repression analysts service. Hiese users may include, for example, 
pathology laboratories, drug laboratories, pharmaceutical companies, collaborators, 
medical and/or veterinaiy pracdtioncKs or similar, owners of perfoimance animal^ athl^es 
10 and/or athletic trainers. Each of these users 505, 510, 515 will be interested in difiterent 
aspects of the gene expression results and will thereibre interact in a different feshion, but 
all will interact remotely via an user interlace module 520. 

Ihteiface 520 may, tor example, be a browser-based inter&ce as fbund on most computers 
IS and delivered via web pages on the world-wide-web (the Internet). TTie initial interaction 
to the user interface module 520 will be via a contrc^ed firewall and web server. ITie 
firewall will be the first Knc of defbnce against unwanted and unauthorised intrusion. Port 
bloddng techniques and protocol restrictions will be imposed at the firewalL The firewaU 
and web server environment will be fiilly maintained with the latest security patches to 
20 ensure currency of protectiim against hackers and intrusion. Bach user will egtah tigh a 
secure connection 525 (user authcndcation and establish secure web connection) to ensure 
confidential identification in both directions for the user and service delivray provider. 
The security is managed by a customer access management system 565 that controls access 
of users 505, 510, 515. Such security measures are oonmuMily used in the art and one 
25 embodiment would be use of SSL (secure socket layer) technology and digital signatures. 
Further security layers can be added at this interface if required and might include 
challengeAesponse component such as continuously changing numerical keys in 
possession of the user and available in plastic card format and trusted networks. 



Class One and Two Users 505, 510 are shown sending information as a query 530 and 531, 
that includes a question regarding health or condition status of an animal (inieipretation 
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lequest). sample details, gene expression results, clinical information, pathology laboxatoiy 
results, gpne identities, gene sequences, collaborative lequests, etc Qass Three Users 515 
are shown sending information 535 as a query including intenogation requests regarding a 
liealth status of individual animaWathletes or groups of individual animal^athletes. 

Queries 530 and 531 may contain formatted gene expression and dim'cal information as a 
request, one such embodiment would employ the use of digitally signed XML documents 
to ensure authenticity and content of the request Other authentication, authorization and 
encryption and key management sumdards wiU be applied as they become available. 



As a further security measure to protect central databases 590, fiom outside unauthorised 
access, queries are temporarily stored in a transaction staging module 540 and queries 532 
and 533 will be drawn into respective pathology service module 550 and collaborative 
services modules 555 only on request fiiom the service module. IWs process may employ 
15 a second firewall and may be conjured to further restrict network traffic. Tliis firewall 
WiU only permit internal requests from 550 555 5<>0 to pass through the firewall. All other 
network traffic will be blocked as wiD umiecessary ports and protocols. Respective 
pathology services module 550 and collaborative services module 555 include special 
software capable of servicing requirements of the different types of users 505, 510 
20 Pathology services module 550 and collaborative services module 555 are sholn in 
communication with each other. Core central databases 590 store genetic information 
(genetic database) 591. sample and gene expression information (sample database) 593 
and correlative data (correlative database & heuristics) 595. The genetic information 
stored in genetic database 591 is used to create gene expression devices Design details 
25 592are also stored in the sample database which contains gene location information on the 
device and are used to interpret results from such a device. 

The genetic database 591 is also used to provide gene identification and gene sequence 
information to collaborative services module 555 and collaborative services 575 (eg. 
30 inten,retations, gene lists and gene sequences) to Class Two users 510. Informatfon in the 
sample database 593 can be clustered together based on similarity using computer 



m 
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algorithms such as K-means, principal component analysis (PCA) and self-oiganising 
maps, commonly available in packages provided by companies such as spotfire, silicon 
genetics, and at higher levels of interpretation. Qmniviz. These dusters amount to 
identified correlations 594 between gene expression and sample information and are stored 
5 in various formats, in the coTTdative database 595. An heuristic or neural network or rule- 
based computer software system pre-programmed with rules or training sets takes queries 
534 (eg. expression details and sample details), stores these details in the sample database 
593 and then compares die query pattern to those already stored in the correlative database - 
595 and produces standardized reports and correlation details 570 (according to the rules of 
10 the heuristic program). Correlatioi, details are converted to useful information such as 
• gene expression correlation results, for example a fiiUy formatted report to include 
interpretations 571 and interpretations 575 (and optionally genes lists and gene sequences) 
and are securely delrveted back to the requestor via the internet to Class One and Two 
users 505, 510. 



15 



Financials database 597 keeps trade of details includit« for example accounting, 
purchasing and payroll detaib. Sales and marketiAs database 596 keeps track of items 
sudi as sales and marketmg details, client details, customer relations management and 
stock management. Internal data warehouse 560 receives infimnation from databases 590, 
20 596 and 597. This internal data warehouse 560 will only be accessed by authorized 
internal users conducting legitimate business activities. A secure (internal) data warehouse 
545 services the needs of Qass Three users 515. Spedfic (and confidcnrial) infonnation 
580 is extracted from intemal data warehouse 560 that is then stored in secure customer 
data warehouse 545 where authorized users 515 can query 535 (for example as 
25 interrogation requests), specific and confidential informarion sudi as clinical history 
information, pathology results and interpretations. Hiis information is presented in a 
secure user-ftiendly and/or visual format 585 in relation to individuals or groups of athletes 
or performance animals, and/or time series of results. 

30 Figure 13 is a flow diagram of one specific example showing steps for assessing a 
biological sample for diagnosing or assessing a condition of an animaL A user collects a 
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biological sample 1010, for example a blood sample from a hocse. At the san.e time 
biological parameters including biochemical and haematdogical parameters, clinical data' 
(includmg blood profile tests) and appraisal info^ation a« collected and recorded in a 
standard format 1015, for example by filling in a standard form. lUe biological sample 
1010 IS processed so that nucleic acids contained therein are detectable when hybridised 
with a complementary (or mismatch-complementary) nucleic add located on an array 
1020. The nucieic add may be detectable by a label incorpomted theidn, for example a 
target nuddc add. Preferably, the array 1020 is a device such as a nucroaxray which is 
read 1030 by standard methods and equipment common to the art to identify and measure 
relative abundance or absolute abundance of those m.deic adds from the biological 
sample which have bound to probe nudeic adds immobilised as part of an^y 1020 
(indusion of a reference sample nin in parallel allows for the calculation of the relative 
abundance of ,a,get middc ad*, whereas a method developed by the company 
Affymetrix. Inc (the "Afiymctrix system-^ as descxfted at their website "a^n.etrix.com" 
15 relies on internal references). 



10 



Array 1020 may comprise a large number of probe nuddc adds, eg. 1000>s of nudeic 
acids. A large mimber of probe iwdeic adds may be particularly useful if an animal is not 
I«s^tingwithanyvisiblesignsofpoorconditio«,eg.overtdisease. Accordingly, i„ one 
20 embodiment, labelled target nucleic adds of a sample arc first appUed to an anay 
compnsing a -full-sc«en- of target nudeic adds (eg. i,000's of nudeic add probes that 
r^i^^nt most or many of the nudeic adds expressed irr a sample). Based on results from 
the full-screening, the labdled nuddc add targe,, may be applied to a sub-set of the full- 
scre^. eg. a sdected panel of nuddc add targets that may be assodated with a particular 
25 condition, for example, respiratory diseases, drug consumption, etc. 

Data from the read microartay 1030 and dinical data and appraisal information 1015 is 
fomiatted 1040 and transmitted via a commum'cations network 1050, for example the 
fatemet, to a n^mo.e diagnosUc server 1060. It wiU be apprcdated that transmission of the 
30 formatted data to the remote diagnostic server 1060 tequixcs less bandwidth than 
transmitting database information to the user and less skill and time on behalf of the user 
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The transmittal d«. is „^ ^ con^arison to a daaba* of 

P-yioody conceal i^tion in rotation to dinic ,„fo™.flo„ ^ session levels 
(«l.t.ve*,™i.«e)of,te^.fc^^^p^„^__^^^^^ Also. experts fer 

5 <!«. «. p»vide addition! .scfcl i«fom«id,„. ti« «„w« co,.ela>.on to a 

«»d.tK>n 80. In this manner, tb, .xp»ssi«n te«b (r^rtive or absolute abundance) of the 
nndeic acid pmbe. appHed to U,e nrfc^aray 1020 a« correlated with previously collected 
data tetating to known conditions s«,ied in a d«ab.« 1080 a„d compfled 1090 The 
da,*a« „ay ^ au„e intormation in rehtion to «. idenHty of taov™ nucleic acids. 

on ae .r„y an^or location of „«a.ie acids on the array, its 
biological function and links to other databases. 

Results in naation to health and p«fornuu.ce condition are .rananri„«i via a 
con,n«„ic«ions network 1050 and nu.y also be p:„vid«l to the user as a teport 1095. for 
15 «™»Ple a hardcopy printout or visuaUy on a computer nuinitor. 

IIK desc*«l system has «lva«agcs of re<,uinug low bandwidth for flitting sample 
<l«.j|nd final .opor. be««en use, ^ „mo.e da.abase*,o«ssor. data ptocessteg is 
«o.r.l»ed and more effident. expert analysis of U,e san^e data is centralised, the 
computer s^ ™y i^^ ^ ^ 

^on. the possiWiie, of user and inte,prc,«ion bias is avoided, and inlbnnation s.o»d 
m the comm«ciaUy valuable database is under stnc control and docs n« n:,ui«> direct 
•ecessbyano^sideuser. The steps ate described in moni detail hereinafter 



20 



25 



30 



ngure 14 shows an envitomnen. for working the method deaoibed in Figure 13. A user 
1100. Which may be a veterinarian or practitfoner. «.IIec,s , aampi, 1,20 liom an ammal 
noi. tor example a blood sample ftom a horse or athWe. Concu„e«Iy. iniinrmadon to 
re^«»n to a condition of the animal is collected in . «aodard t»n^ 1102. Tlte sample is 
collected, nucleic acids isol^ed thereftom, prep^d «d applied an «,ay 1120 and Uie 
«ray „^d by an array leader 1130. I>«a 6o«, U» anny ,e«ler 1130 ..K. clinical 
.ppraisa. and condition infcnnau™, H02 is «.te«l imo . computer ««l fconatted by a 
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processor 1140, which may be for example, a laptop computer with a modem. The 
formatted data is transmitted via a communications network 1150. for example the 
Internet. A remote diagnostic server 1160 receives the transmitted data and the data is 
compared with a database(s) 1161 which stores data, for example, data in relation to 
5 nucleic add location on an array, expression level (relative abundance or absolute 
abundance) of a nucleic add hybridised with a conesponding nudeic add on an array, and 
data correlatiKS nudeic add expression levd and performance, health, or condition of an 
animal. 

10 Figure 15 is a flow diagnun illustrating steps for preparing an array. A biological sample 
1210 is ooUected from an am'maL Biologica] sample 1210 may comprise for example, a 
blood sample (preferably white blood cells isolated thereftom), mine sample, or tissue 
sample (induding fetal tissues and tissues in various stages of devdopment). A spedfic 
aim of coUecting the biological sample is to isolate and sequence as many relevant genes 

15 from the sample for use on an array. Thousands of nucldc adds may be isolated that may 
form a large number of probes for a broad screening of an animal's genetic make-up or 
gene expression pattern. 



Nudcic adds arc isolated fiom the bicd^gical sample. In one instance the sample may be 
20 used to prepare genomic DNA or tissue spedfic mRNA 1223. Tn another instance RNA is 
isolated fiom the biok>gical sample 1210 and a cDNA library 1220 is prepared from the 
isolated RNA. Plasmids 1221 comprising d>NA inserts from library 1220 may be 
sequenced 1222 from dther or both 5» a«Vor 3' end of the nudeic add. Preferably. 
scquendDg is from the 3' end. Sequences may comprise Expressed Sequence Tags (EST). 
25 If an isolated nuddc add does not encode a fuU-length gene (eg. an EST), a partial nudeic 
acid may be used as a probe to isolate a flilMength nudeic add. Altemauvely, or in 
addition, BST sequence information may be compared directly with a sequence database 
1230, for example OenBank, and a search for rdated or identical sequences performed 
Putative gene identification and function 1231 may be determined from a search, for 
30 example a BIAST search performed in st«^ 1230. By determining the number of u'mes 
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each gene is represented in the Hbraor, a computer may be programmed to enable the 
normabsalion and standardisation of the lelabVe abundance data of mRNAs in a sample 

Gene-qn^^c oUgonucIeotidcs 1232 may be synthesised using information from EST or 
5 foll-nucleotide sequence 1222 dau. Gene-specific oligonucleotides 1232 may be used as 
amphficab'on primers to amplify (step 1224) a region of a coirc^onding nucleic acid The 
nudeu: aad used as template to amplify a .egion of conesponding nudeic acid may be, for 
example, isolated plasmid DNA 1221 andA>r genomic DNA, cDNA or mRNA (eg used 
with RT-PCR) 1223. THe nudeic add thus prepared can be used directly as the nudeic 
10 aads for attaddng to an array 1240. Amplification products 1225 may also be generated 
usiQg non-gene-spedfic primers (eg. oligo^, plasmid sequence flanking a nucleic add of 
interest). OKgonudeotides corresponding to a gene 1232 may also be used on array 1240 
altemativdy .he oligoaudeotide corresponding to known sequence can be buili 
successively nudeotide by nudeotide on a support using A^-etrix methodology such as 
15 that in US patent no. 5,831,070, incoqwrated herein by reference. 

In one embodiment, the step relating to construchug cDNA 1220 and isolating plasmids 
1221 comprising the cDNA may be omitted. In this embodiment, isolated genomic DNA 
or tissue spedfic mRNA 1223 is used as a template to make amplificaiion product 1225 by 

20 amplification using gene^-fic primers 1232. Amplification product 1225 may be 
attached to army 1240. 

Nuddc «lds .««*cd to or buu. o„,„ ,240 ,„ferably .^^n. ^ 
prefcnbly .U, .:,p,^ed in a give, Os^ ^ „, ^ ^ 

25 for . comple.. dfagnoafc to, ™«ho,« Mood, .he amy shoold coMain genes 
«p,««d to the eelb of blood unde, various conditt.« „ various s^ge. of ccir 



R^re 16 shows a flow diagram comprising steps for determining gene expression in 

b.ol08icaIsamplescomprisingbothreferencetargetl305arKisanipletargetl310. Nuddc 
adds, in particular RNA (total RNA or mRNAX are isols^tcd from biological samples 1305 



wm. 



L. 
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and 1310, which „,ay be the same sample. cDNA is prepared from ihe RNA and the 
cDNA IS labelled resulting in labeUed tai^ets 1320 and 1325. Alternatively, or in addition, 
CDNA may be »«ed as a template to synthesize labelled antisensc RNA for use as targets 
1320 and 1325. Reference target 1325 may be provided as a pn^viously prepared labelled 
target of known concentiation. Accordingly, reference target 1325 need not be synthesised 
m pamllel with each sample target. Internal comrols for reference target 1325 and sample 
target 1 320 provide a means for normalising and scaling relative probe concentrations. 

Sample target 1320 and reference target 1325 are hybridised with array 1330 in step 1340 
Array 1330 may. for example, have been prepared by steps shown in Figure 15 The 
hybndised array is washed 1345 to remove non-specific hybridisation of targets 1320 and 
1325. It will be appreciated thai one skilled in the art could select ditferent stringency 
conditions of wash 1345 as required. Anay 1330 is read in an array reader 1350 to 
determine reUiive abundance of RNA in the original sample, which correlates with 
15 expression of the correspondi,g gene in the biological sample. 

Figure 17 is a flow diagram iBusttating steps for building a database. Biological samples 
1410 arc coUected from animals having specific known condition(s). Preferably a 
statistically relevant number of biological samples 1410 are collected from a variety of 
normal ammab to establish a normal reference range of nucleic acid abundance levels 
THIS should account for natu«l variation, including that associated with state of fitness 
sex, age, season, breed and diumal changes. Nucleic adds are isolated and labelled 1415 
from sample 1410. thereby forming respective target nucleic acids. T^c labelled ta^et 
nucleic aods 1415 are applied to anay 1420, which may be prepared as described in 
Figure 15. lie anray is read 1430 and data fonnatted 1440 into an electronic form for 
example a digital signal, suitable for transmission via a communications network 1450 
Chnical information from clinical appraisal, in relation to conditions of animals of intetesi 
IS measured, documented and compiled 1460. n,e dimcal information is preferably 
collected in a standard format, and for example, variable states soch as the level of fitness 
30 body score (fatness) may be assigned given a value or number (for example between 1- 
10). Specific clinical conditions may be graded (for example between 1-10) and assigned a 



20 
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unique and standard identifier. An example of such a system is currently used in clinical 
medicine and veterinaiy science and termed SNOMED or SNOVET (Standardised 
Nomenclature of Medicine or Veterinaiy Science), where a duical condition can be 
described using a numerical system. This system Has not been used for describing the 
5 normal condition or the ability of a performance animal to perform to its best A numerical 
gradiag system could also be used to standardise the collection of such data, for example, 
time spent on a tieadmill is a stroAg indicator of exercise tolerance, as is blood 
concentration of oxygen and ability to transport oxygen. Conditions may include disease, 
response to drugs, tnrining. nutrition and environment Hie clinical information 1460 i$ 
10 formatted into electronic form 1440. for example a digital signal, suitable for transmission 
via a communications network 1450. 



The process is repeated such that a coUection of several anay readouts for particular 
conditions are made. A standard range (for example, a population median of 95%) of 
15 values for each of the represented genes and its relative abundance can be calculated. This 
reference rai^ can then be used as a comparison to test sample results. 

Nucleic add eiqwssion information ftom a read array 1430 for a target sample is 
coirelatcd with previously measured conditions 1460 to provide information on nucleic 
20 acid expression level (abundance or relative abundance) with any previously measured 
condition. This information is compiled at server 1470 and good data is stored and bad 
data rejected 1480. The compilation process includes collection of a large enough set of 
anray readout informabon for a particular condition so that inferences can be drawn on 
gene expression profiles and conditions. The compilation 1470 may also include use of 

25 sophisticated pattern recognition and organisational softwan. and algorithms (examples 
common to the ait include algorithms such as K means, Anova and Mamr Whitney, Self 
Organising Maps, principal component analysis, hierarchical clustering - any one of which 
i« avaflable as part of proprietary software packages) such that expression patterns that 
diffisr to normal or expected condition can be identified. The compilaUon 1470 will 

30 preferably include sophisticated methods of supervised classification such as reguhiriscd 
discriminant analysis, diagonal discrimimmt analyjds, support vector machines^ or 
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counted biologic^ ^ uio „ ^ ^^^^ ^ ^ 

b«wc« any cMca, ota«v«K„ «K. .™y AU. to this , ^.^^ ^ 

con^naog d.« o. noddc add wMch ™y ^a,, 

«»<lm», for «a„^„ nonna, and ^ ab«»a». co„di„oo(s), ™u „„„ei, add 

^«n. ■n>e«-«.-«14«.n»ybeac««=d„,i^^.fl,^^^ 



Tlno^gkou. Uu. apecificadon. «„,ca, die con«x, r.<,^,es od»™ise. d,o words conipriac 
con-pmes con„,i.™g ^ „ ^ ^^^^ J 

Of m«g« b« ^, a„ „j „^ 

I. order aa. fl,e techmqaes ouUined ^ ^ ^y 
15 P-««»''««^I»«ic-U,prcl^e„,^„.^^^„^.^.^ 

'bUowmgiKiii-Iiinidne examples. y oi me 



SlEPl 

Collection 

20 A Mologiea. s«^e co^pri^ „„ele,-c adds, for e«n.p,e RNA and n,RNA i. 

con«.c<L Tte s«.p.. may i-Klude «„s of d» sys.em a. va*». «ag.s 

^dd whoto Hood „U.c«d a vd„ of a pe^r^ee anta.,. However, U,e 

25 '>''^*™-n.«iUvagcs,ioi„,fl„Hpcri,o.«allIuidor«.o,.dcl,„id*Hoh in 
^ comprises ceus of immune sys^ Have inm^ s«C a>s.^ „ 
2^ P--. blood whid. comprise „,RNA may ind«fc mau^ tom„^, 
-^op.„, ™»„op,m^ ,ympbocy«>s. n»oocy«s. rcUo^ ^u,^ 

30 ZTn ^' 'PP^ -issue. ofnoM-ood oH^ vl„^ 

3U times m various condirions. 
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Meihods de^ed I»™ may indufc „« <rf to ,b«vem««ion«l cdl types, m 
b.ol<^ i, ^tected .u* pr.p«d u«,g vario« »«h«ls. Fo, example, an .a.y 

m«hod Of «,n«.i,« «Il„f ft. Mood b by v«apu«,„ce. The biologicl sample may be 
collecKd fiom a p«fo™.o„ ammal. fcr example. . ho«. wiU. suspected laminitis. a 
"■"i-w athlete or camel wWi 0«e«dK»«l,O5is. oc . greyto^ 

Blood sample 

T«, ml of blood is dntwn dowly (to p„vem hemolysis) ftom the veto of an ammal 
Ou^ v«. ta a ho,«> atri «nel, veins m. the ft„«i™aimb of homaos and dogs) into a 
1:16 volume of 4% sodium ci,„tc t, p,ev». dolHug and the sample is mixed am. then 
Pto«dm.ice. Tbe«miae is ceut^bged a. 3000 RPM a. 40C for 15 minutes and White 
Wood cea. (WBQ (commonly c.n«l fl» -b„«fy eoaT) are removed ftom the imerfcc. 
between plasma mKi red blood cdls(MC)in.oasepam,e na,e using a pipette. TTte WBQ 
«c ^ treated wifl, .t leas. 20 votames of 0.8% ammom„m cUonde solution to lyse any 
«»«g RBC m-d re^fuged « 3000 RPM a. 40C Ibr 5 mimttes. Tie peUe J 

V^«<»«.w.sbedi„a9*sodinmch,oridc.,e^mnfi.ge^andl.ep.oni«. l^eceU 
peUct IS then used directly in RNA extraction. 

Non-blood hlnl/»fj ical fluid samp l. 

»«.ton..l flmd or ax«dc flui^ at 3000 WM a. 4 OC for 20 mim«es to 

•oUeot cells. &„np|es cmnptising large amounts of mucous ^ nc^ with . ^^^^ 
•s™. su* a. ditwothtctol ptior to cemrtiugatio.. A ceU peHet is then washed in 0.9^ 
^ "*™'*l«'i*.H><«.Hf„8ed and the ceU pellet is used direcdy in RNA ext,^^ 

Tissue biopsy 

A ^ssue biopsy is frozen in dry or liquid niUo^cn and cxushed to powder using a mortar 
and pestle. Hie frozen tissue is then used directly in RNA extraction. 



15 
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Total RNA aad/oi mRNA is isolated from a biological sample. Use of isolated mRNA 
rather than total RNA may provide results with less background and improved signal. 

RNA is commonly isolated by skiUed peisons in the art, and examples of some methods 
5 ibr isolating mRNA are described below. 

Commercially avaflable kits, for example, Qiagen RNA and Direct RNA extraction kits, 
and RNA extraction kits produced by Ihvitrogen (formerly life Technologies) and 
Amcrsham Pharmacia Biotech herein incorporated by reference, may be used by following 

10 the mami&cturer's instructions. Key elements of these mRNA extraction protocols 
inchjde use of an appropriate amount of sample, protection of the sample fiom RNAse 
contamination, elution of the sample from a column at 700C and quantitation and quality 
checking in an agarose 0.7% gel and using an OD 260/280 ratio. About 0.2 gm (wet 
weight) of pelleted white blood cells or tissue is required for each mRNA extraction whidi 

15 will yield about l-2^g of mRNA. Disposable gloves should be worn throughout the 
procedure, with frequent changes. Both the column and solution used for clution should be 
at700a 

RNA quantification and assessment of RNA size and quality include standard gel 
20 electrophoresis methods of running a smaU quantity of an RNA sample on an agarose gel 
with known standaids, staining the gel with for example ethidium bromide to detect the 
sample and standards and comparing relative intensities and size of standard RNA and 
sample RNAs. comparison of the intensities of the ribosomal RNA bands. Alternatively, 
or in addition, RNA concentration in a solution may be determined by measuring 
25 absorbance at 260/280 nm in a spectrophotometer relative to known standards and 
calculated usii^ known formulas. 



cDNA Synthesis and Labelling; 

RNA prepared as described above may be synthesised to cDNA and labeUcd resulting in a 
30 labelled probe using kits provided by suppliers such as Amersham Pharmacia Biotech, 
Invitrogen, Stratagenc or NEN, herein incorporated by reference. For example, a typical 



m 



wmamm 
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reacUon may comprise: template RNA, an oligo-dT primer and/or gcnc-spedfic primers, 
reverse transcriptase enzyme, deoxyribonudeic triphosphates (dNTP), a suitable buffer, 
and a label incoiporated into at least one of the dNTPs. Such a reaction when combined 
with a method of amplifying the resultant cDNA is referred to as RT-PCR (reverse 
5 transcriptase-polymerase chain reaction). A specific example is provided below, but it 
should be noted that other methods of incoiporation of label into DNA can be used and 
that such methods are under constant review and improvement, fbr example some methods 
include the incoiporation of amino-aUyl dUTP- and sid)sequent coupling of N- 
hydroxysucdnate activated dye to increase the specific labelliug of the DNA. 

10 To anneal primei(s) to tempktfe RNA. mix 2jig of mRKA or 50-100 Mg total RNA from 
respective test sample (Cy3) and reference sample (Cy5> in s^iate tubes with 4pg of a 
regular or anchored oligo-dT primer or s«ic-q;>ecific primers in a total volume of 15 
(usiug puiified water to make up the volume). (Regular oUgo dT is 5'-'nT TTT TTT TTT 
TTT TTT TIT, anchored oligo dT is S'-TTT TTT TTT TTT TIT TTT TTV N-3% (where 

15 V=A, C or G; and N=A, C, G or T). Heat mixture to 65oC for 10 min and cool on ice. 
Add 15.0 fil of reaction mixture to reactive Cy3 and Cy5 reactions. 

The reaction mixture comprises of the following: 6.0 ul of 5X first-stiand buffer, 3.0 ^1 of 
0.1M DTT, 0.6 ul of unlabeled dNTPs, 3.0 «1 of Cy3 or Cy5 dUTP (1 mM, Ameisham), 

20 2.0 ul of Superscript n (Reveise transcriptase 200 U/fiL, Life Technologies) made to 15 
with pure water. UnlabcUed dNTPs are sourccd from a stock solution consisting of 25mM 
dATP, 25 mM dCTP, 25 mM dGTP, 10 mM dTTP. 5X first-strand buffer consists of 250 
mM Tris-HCL (pH 8.3). 375mM KCl, 15mM Mga2). The mixture is incubated at 42oC 
for 1 br. Add an additional 1 pi of reverse transcriptase to each sample. Incubate for an 

25 additional 0.5-1 hrs. Degrade the RNA and stop the reaction by addiqg 15^1 of O.IN 
NaOH, 2mM EDTA and incubate at 65-70oC for 10 min. If starting with total RNA. 
degrade the RNA for 30 min instead of 10 min. Neutralize the reaction by adding 15^1 of 
0.1N HQ. Add 380|il of TE (lOmM Tris, ImM EDTA) to a Microoon YMOO column 
^illiporeX 

30 
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Next add 60pl of Cy5 probe and 60m] of Cy3 probe to the same microcon. Centrifuge the 
column for 7-8 min. at 14,000 x g. Remove flow-through and add 450 pJ TO and 
centrifuge for 7-8 min. at 14,000 x g (washing step). Remove flow-through and add 450 {xl 
IX TE, 20 Jig of spedes-specific Ctotl DNA (ZOugAil, Life Technologies for human - CWl 
5 DNA is genomic DNA that has been denatured and re-annealed such that the concentration 
of the DNA and the time of le-anneaUng multiplied equals 1. Methods for maldng Cotl 
DNA are common in the artX 20ng polyA RNA (10 ng/ul, Sigma, #P9403) and 20 ng 
tRNA (10 iig/ul, life Tedmologies, #15401^11). Centrifuge 7-10 min. at 14,000 x g. 

The probe needs to be concentrated such that with the addition of other solutions required 
10 for hybridisation the volume is not excessive, or is suitable for use with a desired slide and 

cover slip size. Invert the microcon into a clean tube and centrifuge briefly at 14,000 RPM 

to recover tiie probe. 

A nucleic acid may be hibelled with one or more labelling moieties for detection of 
15 hybridised labelled nucleic acid (ie. probe) and target nucleic acid complexes. Labelling 
moieties may include compositions that can be detected by spectroscopic, photochemical, 
biochemical, inmiunocheniical, optical or chemical means. Labelling moieties may 
include radioisotopes, such as 32P. 33P or 35S. chcmfluminescent compounds, labelled 
binding proteins, heavy metal atoms, spectroscopic markers, such as fluorescent markers 
20 and dyes, magnetic labels, Lnked enzymes, and the like, Prefeixed fluorescent markers 
include Cy3 and Cy5, for example available from Amersham Pharmacia Biotech (as 
decribed above). 

cRNA synthesis and labelling 
25 The Affymetrix system uses RNA as substrate and generates biotin labelled cRNA through 
a series of reactions detailed in a protocol available fiom their website (aJ^metrix.com), 
incotporated herein by reference. The cRNA is fl»gmented prior to application onto the 
array. 



30 
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Arrays 
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One feature is an array comprising nucleic acids representing expressed genes from cells 
found in blood of a perfonnance animal, for example a horse, human, camel or dog. The 
nucleic acids may be of any length, for example a polynucleotide or oligonucleotide as 
defined herein. 

5 

Each nucleic acid occupies a known location on an array. A nucleic acid target sample 
probe is hybridised with the airay of nudcic adds and an amount or relative abundance of 
target nuddc add hybridised to each probe in theanay is determined. 

10 High-density arrays are usefiil for monitoriAg gene expression and presence of alleKc 
markers wMdi may be associated with disease. Fabrication and use of high density arrays 
in monitoring gene expression have been previously described, for example in WO 
97A0365. WO SKWQSSS and US Patent No, 5,677,195, all incorporated herein by 
reference. In some embodiments, high-density oligonucleotide arrays are synthcsised 
15 using methods sudi as the Very Large Scale Immobilised Polymer Synthesis (VLSIPS) 
described in US F^m No. 5,445,934, incorporated herein by reference. 

Arrays for human are commercially available fiom companies sodi as Incytc, Research 
Genetics, and Affymctrix. Uon Biosdence recently announced forthcoming release of a 
20 dog mictoarray and have a done coUection of dog cDNAs. These aitays typicaUy 
comprise between 2.000 and 10,000 genes and are spedes spedfic None are available for 
the horse or camel. Some of these genes are in multiple copies on the anay and have not 
been fully annotated or given a tnie gene identity. AdditionaUy, it is not known whether 
DNA on the array, when hybridised to a test sample, spedfically binds to a single gene. 
25 Tliis latter instance results fiom splice variants of RNA transcripts in tissues such that one 
gene may encode maltq>le transcripts. 



30 



Human and dog arrays (when available) can be used in methods described herein. 
However, these anays are cuirenily non-spedCc and include genes that are not expressed 
in blood cdls of animals, and/or do not contain genes important in controlling the fanciion 
of btood cdls, and/or comain regions of genes that are not specific to blood cdls. 
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Ciones containing specific genes are available and can be purchased for human (mouse and 
dog) for use on arrays (for example from the IMAGE consortium or Lion Bioscience). 
However, it is not possible to obtain specific clones for use on a blood^pccilic array 
5 without prior knowledge of what genes are expressed in blood ceUs. The IMAGE 
consortium also does not guarantee that the gene of interest is contained in the done 
purchased. 

Array Construction 

10 Because of difScufties, problems and a likelihood of wasting financial resources to obtain 
a bIood-^)ecific DNA array, a method is piovided herein which provides rapid and cost 
effcctive generation of spedes and tissue-specific DNA arrays for assessing nucleic add 
expression in a sample. Figure 14 shows st^ for constmcring an array in one 
embodiment. 

15 

Target Nuddc Add Preparation 

Biological samples are coUected as described above. Samples comprising cdls expressing 
as many genes of interest in relation to c(»idition(s) of a peiformance animal are collected. 
For example, a sample comprising a mixture of nucleated blood cells from performance 
20 animals with conditions such as, osteochondrosis, laminitis, tendon soreness, bursitis, 
abcesses, inflammation, allergy, viral infection, parasite iitfection, asthma, etc. 

Approximately 5 Mg of mRNA is isolated firom the biological sample (typically 1 gm wet 
weight) using mRNA isolation kits or the protocol described above. Concurrently, 5 of 

25 mRNA is isolated from umbincal cord blood, and/or early stage foetus. Cells and tissues 
contained within these sources would express genes that may not be expressed in the cells 
extracted from blood in the above example. Isolation of cytoplasmic mRNA jfrom cells is 
preferred. Hiis step involves rupturing the cells with a solution comprising detergent 
and/or chaotropic agent and salt such that cell nudd and the nudear membrane remain 

30 intact. The cell nudd are pelleted by centrifugation and the supernatant is used for mRNA 
extraction. Protocols for this procedure are available as part of mRNA isolation kits (eg 
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availablc by Qiagen). These mRNAs may be used to conslnict cDNA libraries. Kits for 
the construction of cDNA libraries are available from companies including Stratagenc and 
Tnvitrogen (eg Uni-ZAP XR cDNA synthesis library constroction kit #200450). The 
library preferably should be constructed such that the orientation of the cDNA in the vector 
5 is known, that the mRNA i« piimed using oligo dT, the vector is capable of receiving a 
nucleic acid insert up to 10 kb and that purification of DNA suitable for DNA sequencing 
is possible and easy. By following the manufacturer's instructions and paying particular 
attention to the quality of mRNA used and the size fractionation of cDNA (greater than 0,7 
kb), a quality library containing enough viruses (>lxl06) with insert sizes >0,7 ld> can be 
10 generated. 

Plasmids generated from such a library can be DNA sequenced using protocols that are 
well established in the art and are available, for example, firom Applied Biosystems. 
Briefly, a mix of 0.5 jig of plasnud DNA, 3.2 pmol of a primer that hybridises to the vector 

15 DNA (eg Ml 3 -21, or M13 reverse primer), thennostable DNA polymerase, dNTP and 
labelled dNTP is subjected to a routine PGR procedure to generate fragments of DNA that 
can be separated by gel electrophoresis and using madiinery such as that av^ble from 
Applied Biosystems (eg a 3700 DNA sequencer). Generated DNA sequence data 
(chromatogram) is assessed and quati^ scores and binnii^ of similar sequences is done 

20 usirig a computer program package such as Phred/Phrap/Consed. The raw DNA sequence 
data can then be loaded into a database where comments (aimotation) on the sequence can 
be made, such as quality score, bin, length of poly A sequence (should there be one), 
BLAST search results, highest homology in Gcnbank, done identity, other entries in 
Genbank. 

25 

Subjective fectors influencing whether a nucleic add should be used on an array include 
quality and confidence of the DNA sequence, a Genbank homology score with identified 
nucleic acids, evidence of a poly-A tail (indicative of a translated transcript), uniqueness of 
the 3' sequence data (compared to both Genbank and an in-house database of clone 
30 sequences). 



m 
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Nucleic acid primers can be selected using a program such as Primer 3 available via the 
Internet (ww-genome.wi.imt.cdu/cgi-bin/piimer/priinei3). The selected primers may be 
used for ampli^ing a nudeic add, for example by PGR, or directly applied to an array. 
Uniqueness of a nudeic add can be tested by performing additional BLAST searches on 
5 Oenbank and an in-house database. Primers are preferably designed sudi that melting 
temperatures are similar, and amplification products are of a similar nudeic add length. 
Primers for PGR are generaUy between 18 and 25 nudeotide bases long. Piimeis for direct 
use on a microanay or device are preferably between 50 and 80 nudeotide bases long. 
Both the amplification product and the single primer should hybridise to DNA that 
10 uniqudy identifies a gene transcript. SpcdiEc programs using various formulas are 
available for calculating the meltiqg temperature of various lengths of DNA (eg Primer 3). 
Alternatively, selected DNA sequences can be provided to Af^mctrix for production of a 
proprietary and custom anay. The sequences generated in-house are provided to 
Afiymetxix in Pasta format along with details of whidi parts of the sequence to be used for 
15 the generation of a probe set (11 probes, each 25 nudeoUdc bases long) for each gene 
represented on the array. . 



Nudeotide sequences may be compared with an existing database, for example Genbanl^ 
to determine a previously provided name, tissue e3q>ression, timing of ^piession, 

20 biochemical pathway, duster membership, and possible function or cellular role of an 
expressed nudeic add. In addition, a nucleic add fragment may be used as a probe to 
isolate a fiill-length nuddc add which may encode a gene which is assodated with a 
particular disease or condition. Further, identified nucleic adds may be used to isolate 
homologues thereof indusivc of orthologoes fiom oflier spedes. An identified nucleic 

25 add may also be cloned into a suitable expression vector to produce an expressed 
polypeptide in vitro, which may be used, for example as an antigen in generating 
antibodies and for use on protein arrays. The antibodies may be used for developing 
specific diagnostic assays or therapies, for three dimensional protein structure such as X- 
lay czystallographic studies, or for therapeutic developmeuL 
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Ai) airay may comprise any number of different nudcic acids, but typically comprises 
greater than about 100, preferably greater than about 1,000, more preferably greater than 
about 5,000 different nucleic acids. An array may comprise more than 1,000,000 different 
nucleic acids. Each nudeic acid is preferably represented more than once for scanning 
5 internal comparison and control. Preferably, the nucleic adds arc provided in small 
quantities and aic gene-spedCc and/or spedes-spedfic usually between 50 and 600 
nudcotides long, arranged on a solid si^poit 

The Affymctrix system uses 1 1 probes per gene, each of 25 nudeotidca, that are built onto 
10 the array using a photolithographic method (US Patent Nos. 6,309,831; 6,168,948; 
5,856,174; 5,599,695; 5,831,070; 6,153,743; 6,239.273; 6,271,957; 6,329.143; 6,310^189 
and 6346,41 3> The nudeic adds may be dotted onto the soUd support or bound to 
nriccospheros, or in solution. A typical array may have a suijgice area of less than 1 cm2, 
for ocample a microanay. 

15 

A nuddc add can be attached to a solid support via chemical bonding. F^jrthermore, the 
nucleic add does not have to be directly bound to the solid support, but rather can be 
bound to the solid support through a linker group. The linker groups may be of sufQdent 
length to provide eotposure to the attached nudeic add linker groups may include 
20 ethylene glycol oligomers, diamines, diadds and dte like. Reactive groups on the solid 
support surface may react with one of the terminal portions of the linker to bind the linker 
to the solid support. Another terminal portion of the linker is then fiinctionabscd for 
binding the nudeic add. A solid support may be any suitable rigid or semi-rigid support, 
induding charged nylon or nitroceUulose, cheraicaUy treated glass sUdes available from 
25 companies such as MEN. Coming, S&S, arrays avaflable through Affymetrix, membranes, 
fllteis, chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, 
polymers, micropanidcs and capillaries, llie soUd support can have a variety of surface 
forms, such as wells, trenches, pins, channels and poies, to which the nuddc adds are 
bound. Preferably, the solid support is optically transparent. 

30 
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The anay may be constructed using an "arraying madiine" manufactured by compames for 
example Molecular Dynamics, Genetic Microsystems, Hitachi, Biorobotics, Ameisham, 
Coming. Alternatively, the array may be manufactured according to spoaUc instructions 
provided by the user to A^rmetrix. Source materials for this machine include nricrotitce 
5 plates compridng nucleic aa'ds represratative of unique g<aies, or sequence information. 
An array element may comprise, for example, pJbsmid DNA comprising nucleic adds 
specific for a gene sequence, an amplified product using gene-specific or non-specific 
primers and template DNA or RNA, or a synthesised q>ecific oligonucleotide or 
polynucleotide. Array elements may be purified, for example, using S^hactyI-400 
10 (Amersham Pharmacia Biotech, Fiscataway, NJ.X Qtagen PCR cleanup columns, or high 
performance liquid chromotography (for oligonucleotides). 

Purified anay elements may be apfdied to a coated glass substrate using a procedure 
described in U.S. Pat. No. 5,807,522, incoipomted herein by reference. By other example, 
15 DNA for use on Coming amino-sitane coated slides (CMT-GAPSTM) is re-su^nded in 
3xSSC to a concentration of 0.15-0.5 Mg/|Jl and then used direcdy in an arraying machine 
in 96 or 384-well plates. 

An exanqple for preparing an array element is provided by the manganese superoxide 
20 dismutase gene. A done comprising a midcic acid insert is prepared and isolated as 
described above. The done is sequenced to identify the nudeotide sequence. A BLAST 
search using the identified nudeotide sequence is performed to determine homology of the 
doned nucldc add widi nuddc adds in a database, for example GenBank. Identification 
of nucleotide sequence homology with superoxide dismutase genes stored in the database 
25 provides a level of confidence that the clone comprises at least in part a gene for 
superoxide dismutase for the horse. Unique primers can be designed to amplify a nudcic 
acid using PCR and the done DNA, or genomic DNA from the same species as a template. 
Purified amplification product can be directly attached to an array and thereby act as a 
target for a complementary labelled nucleic add probe in the test and reference samples. 
JO Alternatively, a unique sequence can be determined and an oliognudeotide manufectured 
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and purified for direct use on an array, or the sequence information supplied direcdy to 
Affymetrix for the construction of a custom array. 



The array may comprise negative and positive control samples (preferably as duplicates or 
5 triplicates) such as nucleic aads from species dififerent from a sample being tested 
(negative controls) and various nucleic acids (representative of RNAs and both ends of 
RNA molecules) that are found in aU tissues as a constant and known quantity Orositive 
controls). These controls are identified and used by the array reader to provide data on true 
signal (ic. Specific hybridisation between probe and target) and noise (ie. Non-specific 
10 hybridisation between probe and target) and average intensity from multiple reads of 
several different locations for each nudeic add attached to the array. 

A test sample and a reference sample may be simultaneously assayed on the array. The 
reference sample may comprise mRNA from multiple sources, such that most, preferably 
15 aU of the nucleic acids on the array are r^resentcd in the test sample, and can be used by 
the array reader as a non-zero standard and for comparison with an average of the read- 
outs from the test sample. A relative intensity for each gene on the array can be calculated. 

The relative abundance of expression of each gene in a sample can also be calcukued using 
20 controls within the array, such as certain genes expressed in a tissue at a constant level 
under all conditions. 



Alternatively, using the AfFjrmetrix system, an absolute level of expression is calculated 
based on the difference between the perfect match and mismatch hybridisation for each of 
25 the 11 probes for each gene. Using such a process a gene is scored as present or absent 
and an absolute measure of intensity is given along with a p value. 



30 
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The interpreted array may highlight only a few genes that are substantially differem 
expression between a test and reference sample. Altemauvely, the overall pattern of 
expression may provide a "fingerprint" to characterise the way in which the original cells 
have responded to a particular condition of a performance animal. For example, the gene 
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for superoxide dismutase may be the only g(aie up-regulated in a particular condition, 
especially ia conditions of inflamniatioD, or a large number of genes may be up- and 
down- regulated in various conditions. It is this fingeiprint, raflier than specific knowledge 
of gene sequence or function that can be used as a marker for various conditions. It would 
5 be expected that fii^geiprints be useful across species barriers to include performance 
animals such as humans, horse, dog and camel. 



The^ anangcmmt- of nudeic acids on the array may be poiodically dianged -and these 
arrays are tbsn assigned a particular bateh code that conesponds to a specific array 

10 cofflprisiiig a spedfic nuciek acid arrangement. The ability to change the arrangement of 
nucleic acids on the artay and knowledge of the exact arrangement may prevent other 
people flx>m generating a database using the arrays described above. Using a batdh code 
also enables tradung of manufacturers of the arrays in le^rds to the number of arrays 
produced. The batch code further enables validation of a user of die communication 

15 network or "intcnncf diagnostic method and system. Batch code can also identify a 
particular type of array used, should more disease-specific arrays be designed and 
manufactured. 

An example of how an array may be prepared and anafysed is described in Eisen and 
20 Brown (Methods in Enzymology, 1999. 303 179) and In US Patent No. 6,114414, hetetri 
incorporated by reference. Chapter 22 of Ausubel et aL supta also describes methods and 
apparatus for use with arrays and is herein incoiporated by leference. 

Control samples may be respectively labeUed in parallel with a test and reference sample. 

IS Quantitation controls within a sample may be used to assure that ampUficaiion and 
labcfling procedures do not change a true distribution of nucleic add probes in a sample. 
For this purpose, a sample may indude or be '^iked" with a known amount of a control 
nudeic add which spedfically hybridises with a control target nudeic add. After 
hybridisation and processing, a hybridisation signal obtained should reflect accurately 

0 amounts of control nucleic acid added to the sample. For such purposes, a microanay may 
have internal controls, for example a nudeic add encoding a common gene repressed by 



m 



14-n-02;22 : 52 ;DAVIES COLLISON CAVE Pat,8cTr8d ; 61 7 3368 2262 # 73/110 



69- 



the perfomiaiice animal with known expression levels and a nudeic add encoding a gene 
ftom another species thai is known not to hybridise to the test or reference sample. To 
improve sensitivity and spedficity of the assay, bloddqg agents such as Cbt DNA from the 
tested species may also be used. 



STEP 4 



Hybridisinpi <Satn pie Nudeic Add Probes with an Array 

Nudeic add probes may be prepared as described above from a biological sample fiom a 
10 peiformance animal that has been assessed concurrently by physical inflection and/or 
blood tests or other method. Nuddc add targets iirom a statistically relevant number of 
normal animals previously hybridised to arrays, and a reference range for each of the genes 
on the array is calculated and used as a normal reference range (for example a 95% 
population median). Results from a test sample from a test animal can be compared with 
15 the same genes as die normal re£esrence to determine if the test sample fells within the 
normal reference rai^. FUither. nuddc add targets may also be prepared ftom biological 
samples from apparently normal animals, animals with overt disease, various progressive 
stages of disease, hitherto undiagnosed or undasdfled conditions or stages of such 
conditions, animals treated with known amounts of drugs (l^aL or otherwiseX animals 
20 suspected of being treated with drugs (legal or otherwiseX animals under specific cxerdse 
regimes for die sake of performance, animals subjected lo (intentional or not) various 
nutritional states and/or ravironmental conditions. Databases of information from the use 
of sudi samples and armys are created such that* test samples can be compared. The 
database will then contain ^edfic patterns of gene expression fbr particular conditions. 

25 

Prior to hybridisation, a nucleic add piobc may be fragmented. Fragmentation may 
improve hybridisation by minimising secondary structure and/or cross-hybridisation with 
another nucleic add probe in a sample or a nudeic add comprising non-complementary 
sequence. Fragmentation can be performed by mechanical or chemical means common in 
30 the art. 
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A labelled nucleic add target may hybridise with a complementary nucleic add probe 
located on an array. Incubation conditions may be adjusted, for example incubation time, 
temperature and ionic strength of buffer, so that hybridisation occurs with pxedse 
coiuptementaiy matches (high stringency conditions) or with various degrees of less 
5 complementarity (low or medium stringency conditions). High stringency conditions may 
be used to reduce background or non-specific binding. Specific hybridisation solutions 
and hybridisation apparatus are available commerdally by, for example, Stratagene, 
-Clontech, Geneworks. 



Affymetiix have detailed a standard procedure for the hybridisation of probes with an anay 
(as describe at their website, affymetiix.com, incorporated herein by reference), however, a 
typical method emails the following: 



Adjust probe volume (prepared as above) to a value indicated in the "Probe & TE" column 
15 bdow accordiqg to the size of the cover slq> to be used and then add the appropriate 
volume of 20XSSC and 10% SDS. 



Cover Slip Size 
(pom) 


Total Hyb 
Vohime(l) 


I Probe & TE 
(1) 


20xSSC(l) 


10% SDS ( I) 


22x22 


15 


12 


2.55 


0,45 


22x40 


25 


20 


4.25 


0.75 


22x60 


35 


28 


5.95 


1.05 



20xSSC is 3.0 M Nad, 300 mM NaCitrate (pH 7.0). 

20 

Denature the probe by heating it for 2 min at lOOoC, and centrifuge at 14,000 RPM for 15- 
20 min. Place the entire probe volume on the array under the appropriately sized glass 
cover slip. Hybridize at 65oC (temperatures may vary when using different hybridisation 
solutions) for 14 to 18 hours in a custom slide chamber (for example a Coining CMT 
25 hybridisation chamber #2551). 
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Washing the Array 

After hybridisation, the array is washed to reniove non-specific probe and dye 
hybridisatioa Wash solutions generally comprise salt and detergent in water and are 
commercially available. The wash solutions are ^plied to the array at a predetermined 
5 temperatxire and can be performed in a commercially available apparatus. Stringency 
conditions of the wash solution may vary, for example from low to high stringency as 
herein described Washing at higher stringency may reduce background or non-specific 
hybridisation. It is- understood that standardisation of this step is required to produce 
maximum signal to noise ratio by varying the concentration of salt used, whether detergent 
10 is present (SDS^ the temperature of the wash solution and the time spent in the wash 
solution* 

A typical wash protocol consists of removing the slide fifom a slide chamber^ removing the 
cover dip and placing the sKde into 0.1%SSC (recipe provided above) and 0.1% SDS ai 
15 room temperature for 5 minutes. Transfer the slide to 0.1% SSC for 5 minutes and repeat. 
Dry the slide using centrifiigation or a stream of air. Equipment is available to enable the 
handlirig of more than one slide at a time (for example^ slide racks). 

STEPS 

20 Reading the Array 

After removal of non-hybridised probe, a scanner or "array reader^ is used to determine the 
levels and patterns of fluorescence from hybridised probes. The scanned images are 
examined to determine degree of hybridisation and the relative abundance of eadi nucleic 
acid on the array. A test sample signal corresponds with relative abundance of an RNA 

25 transcript, or gene expression, in a biological sample. Alternatively, an Af fy metrix array is 
read and conqputer algorithms calculate the difference between hybridisation on perfect 
match and mismatch probes for each of the 11 probes sets for each geno. It ihen calculates 
a presence or absence, an absolute value for each gene and a p value for the absolute call. 

30 Array readers are available commercially from companies such as Axon and Molecular 
Dynamics and Affyraetrix, These machines typically use lasers, and may use lasers at 
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different frequencies to scan the array and to differentiate, for example, between a test 
sample (labelled with one dye) and the control or reference sample (labelled with a 
different dye). For example, an array reader may generate spectral lines at 532 nm for 
excitation of Cy3, and 635 nm for excitation of Cy5- 

5 

A relative quantity of RNA may be calculated by the array reader and computer for 
respective nucleic acids on the array for respective samples based on an amount of dye 
detected, average of duplicate samples for respective g^es and subtraction of background 
noise using controls. The reader is pre-programmed to perform such calculations (using 

10 proprietary sofhvare supplied with the array reader, such as MAS 5.0 for the Affymetrix 
system and Gcnepix for the Axon Instrum^ts reader) and with information on the location 
of each nucleic add on the array such tibat each nucleic add is given a readout value. 
Controls or reference samples providing a readout for particular nucleic adds that falls 
within standard ranges ensures correct integrity of the array and hybridisation procedures. 

1 5 Programs typically generate digital data and format it for transmission 

STEP6 

Querying and Transfer of Digital Data to a Central Database 

20 Generated data is transmitted via a communications network to a remote central database. 
A user haviiig access to the gene expression data enters information in relation to a tost 
sample into a standard diagnostic form such that it can be digitalised. The information will 
indude clinical appraisal and blood proiile results. The format of such information is 
standard globally such that details on dinical conditions may be based on numerical input 

25 and each field of entry can be digitalised For example, body temperature field could be 
number 0001, a recorded temperature within normal range would receive the number 0, 
0.5OC above what is considered to be the normal rai^e for that spedes would receive a 
nutnbcr 5, IOC above normal range would receive 10. Some examples of conditions that 
may be scored or rated in such a fashion are provided below. 

30 

a) Body temperature. 
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b) Integument: eyes, sores, abcesscs, wounds, insects/paraates, allcfgy, infection. 

c) Cardio/Respiratory: eyes, nasal discharge, rales, viial/bacterial infection, allergy, 
chronic obstructive pulmonary disease, cough/wbeeze, crepitous sounds in the thorax, 
epistaxis, auscultation sounds, heart sounds, capillary refill, mucous membrane colour. 

d) Gastrointestinal: diarrhoea^ colic/stasis, parasites, appetite level, drenching time and 
dose. 

e) Reproductive: stage of pregnancy, abortion^ inflammation, discharges, 

f) Musculoskeletal: lameness laminitis, bone or shin soreness^ muscle soreness or tying 
up, tendon or ligament affected, level of pain. X-ray data, sdntigtaphy data, CAT scan 
data, bursitis, bruising, cramping or '^ing up^. 

g) Blood test results: biochemistry, immunology, serology (vizal, baaeriological. 
hormone levels), cell counts, cell morphology, pathologist interpretation. 

h) Other diagnostic test results: X-ray, biopsy, histopathology, CAT scan, MRI, 
bacteriology, virology. 

i) Other data: Season (date)^ location, male or female, vaccination history, body score 
(fitness and fat% fitness levcL 

Alternatively, the entire system could be based on the aforementioned SNOMED system 
with appropriate modifications to encompass descriptions of «:ercise physiology and the 

20 normal animal. Alternatively, the entire system could rely on text or categorical data that 
can be appraised and scored by software such as Omniviz. Whatever system is used, if 
would be appreciated that the aim is to adequately, systematically and in a standard manner 
describe the current condition of the animal to the best of curr»tly available technologies 
and cotild include results from machinery such as X-ray, ultrasotmd, scintigraphy and 

25 blood analysis* 

The user also ensures that array results (that may for example be automatically collected 
from a reader), array ^edfications, data mining specifications, level of interpretation 
required and the clinical information are entered and correspond to the same animal and 
30 the same sample. The form is transmitted electronically to a central database and 
recognised as an individual accession or request by the database. The central database 



14-11-02;22:52 JDAVIES COLLISON CAVE Pat.&Trad ;61 7 3358 2262 # 78/110 




-74- 

recogniscs the user (using for example digital certificates), the user recognises the central 
database, the array batch code and gene array order are verified, and the user is allowed 
access (which may be automatic) and automatic processing of the request is performed if 
security and billing information are adequate. The processing involves specific mining of 
5 central data and specific user requested information is retrieved and resent automatically. 

The above steps may be automated so that a user need not be present to perform the tasks. 
In an automated spedfic^xamplc, gene expression data from an array reader may be 
transmitted via a communications network directly to a server which is connected to a 
10 central database. Additional information could be iiq>ut by the user at a processor which is 
also Unked to the array reader. 

Automated Data Mining Using Sent Data (Heuristic Methods) 

A central database interprets the array specificaiions (eg- nudeic acid order on a 
microarrayX decodes the information transmitted, determines nucleic add expression level 
in a biological sample and compares the expression level and patterns of expression with 
known standards or reference range. Various leveJs of database interpretation may be 
applied to the data transmitted, d^ndlng on the user requirements. Clusters of genes may 
be tqp-regulated or down-regulated in certain conditions and the database makes automated 
correlations to q>ecific conditions by accessing various levels of database information. 

Mining software such as Metaminc (Silicon Genetics^ ArraySCOUT (Uon Biosdence) 
can be used in this instance, and more advanced data mining tedmologies could be used to 
identify patterns and nearest neight>our information in data (such as products from AnVil 
25 Informatics Inc and OmniViz Inc). Further, software capable of taking rule-based 
instructions (such as that described by Pacific Knowledge Systems Sydney Australia in 
their "ripple down" technology) and having the ability to self leara (heuristics and neural 
network systems) such as that described in Khan et al. Nature Medicine 7 (6) 673, 
incorporated herein by reference, could be used at this stage to limit the level of human 
30 interaction in determining a diagnosis. In this latter example, an artificial neural network 
is used, and samples are divided into training and validation sets to create trained 
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calibrated models. The calibrated models arc then used to rank genes in diagnostic 
iniportance. 

Levels of database may include: 
5 m Unique gene sequences (eg 3' and 5' EST sequence of genes) 

• Gene identity, homologous genes, tissue expression, keywords, function, cellular 
role, gene clusters, biochemical pathway, PubMed references 

• Pcimer sequences used to generate amplification products (eg two primer sequences 
used to uniquely amplify the gene for gamma interferon in a particular species) 

10 ♦ Microairay construction and fonnat (eg coded information on array manufacture 
batcb and identibScation of genes and position on the array) 

• Blood profile and clinical data associated with particular condiiions (eg standard 
clinical information and IDEXX-machine generated blood profile data) 

• Array data for normal and apparently normal status (eg 95% median range for 
15 normal animals) 

• Array data for indudble disease and disease models 

• Array data for various overt diseases (eg joint inflammation) 

• Array data for stages of various overt diseases (eg pre-dinical, dinical and 
recovery stages) 

• Array data for the influence of various classes of drags, legal or otherwise, of 
known administmtion and dose, or unknown administration or dose (eg- various 
steroids) 

• Array data for the re^onse to known and various levels of drags used as a therapy 
(eg various anti-inflammatory medication at specific doses for a specific condition) 

25 • Anay data for the response to exerdse and various training regimes 

• Array data for the response to nutrition and various feeding regimes 

• Array data for the re^nse to the environment so as to possibly deiennine 
influence of duriqg various seasons, or allergens or feed types. 

30 Each successive level relies on at least one previous level of database to allow for 
inteipretation. The database may be built over time and more intensive searching of the 
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database may iDcur a greater cost. As ihe database grows, changes may be made to the 
above methodology to increase the sensitivity of the detection of variation in expression of 
condition-specific genes - this could include the use of condition-specific arrays or 
condition-specific primers. Condition-specific airays can be manufactured by a company 
5 such as Affymetrix (under instructions) that would allow for increased sensiidvity and 
specificity, much reduced size of arrays, decreased cost of production, and the ability to 
process multiple samples at once. The process of building the database is iterative, such 
that specific genes are correlated to specific conditions, and the detection of variations in 
these genes becomes more sensitive and ^dfic through the use of various modifying 
10 processes through the procedure (eg. the use of gene-spedfic primers for the amplification 
and labelling of cDNA from RNA, and the selection of limited numbers of genes on a 
disease- or condition-specific array, detection of splice variants and single nucleotide 
polymorphisms). 



15 STEP? 
Standardised Electronic Reporting 

The database reports back electronically to a remote user, cither automatically or with a 
level of human intervention. The electronic report may be converted to a printed 
document. The repon provides details of an animal ^s condition that is determined by 
20 correlation of gene <»cpresston data with in£onnation stored in a remote database^ and 
optionally expert analj^is. 

Information sent might indude: 

- Individual genes up-rc^gulated or down-regulated (for example, with laminitis or 

25 joint capsule inflammation or bursitis, a report on the up-rcgulation of genes such as 
interleuIdn-3, manganese superoxide dismutase, Groot, mctalloproteinasc matix- 
metallo-elastasc, ferritin hght chain may have some correlation to tissue inflammation, 
and down-regulation of genes such as insulin-Uke growth factor and its receptor may 
be correlated to recovery from such a condition). The identity of these genes cannot be 

50 predicted to be associated to any condition unless the above described methodology is 
used and databases on relative expression of genes for particular conditions have been 
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compiled. Therefore a screening test covering all genes may need to be pexformed first 
and a second, more specific test then zppiied. 

• The overall pattern of ^ne expression and any correlation to particular conditions. 
For example, animals in heavy training may have a gene "fingeiprint" that is different 

S to animals being spelled from training. 

• Individual pattern of gene expression (ie. the shape of the gene expression pattern 
over a time course or multiple samples taken over a period may change as an animal 
recovers from a condition) 

• Changes to a pattern of gene expression* gene ^pression profile or level for a 
10 sing^ animal over a time period or f6r successive tests. 

• Ousters of genes up-regulated or down-regulated in a particular condition 

• Pathways of genes up-regulated or down-regulated in a particular condition 

• CoTxeJations between g/mes up-regulated or down-regulated and known conditions, 
or stage of condition, or influence 

15 • Known therapies to ameliorate the condition or enhance desired effects 

• Spedalist pathologist written interpretation 

• Relevant information of use to veterinarians, medical piactitioners, owners^ trainers 
and athletes 

• Collections of data on groups of animals under q>ccific management regimes 

20 

Throughout the specification the aim has been to describe the preferred embodiments of 
the invention without linuting the invention to any one embodiment or specific collection 
of features. It would therefore be appreciated by those of skill in the art that, in light of the 
instant disclosure, various modifications and changes can be made in the particular 
25 embodiments exemplified without departing from the scope of the present invention. For 
example, the examples described hciein may be used with performance animals other than 
horse, for example human, dog and cameL 

AU references, inclusive of patents, patent applications, scientific documents and computer 
30 programs, refcired to in this specification are herein incorporated by reference in its 
entirety. 
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THE CLAIMS DEFINING THE INVENTION ARE AS FOLLOWS: 
1) A method of determinitig the status of a subject^ the method incltidiDg: 

a) Obtaining subject data, the subject data including one or more parameter values, at 
least one of the parameters being indicative of the current biological status of the 

5 subject; 

b) Comparing the subject data to piedetermined data, the predetermined data 
including for each of a number of individuals: 

i) 'One or more values for the parameters for the respe^vc individual, at least 
some of Che individuals having a number of conditions relevant to the status of 

10 the individual, the number of parameters being statistically sufficient to 

distinguish eadi of the conditions; and, 

ii) An indication of the status of the respective individual; and, 

c) Determining the status of the subject in accordance with the results of the 
comparison. 

15 2) A method according to claim 1, the number of parameters being greater than 100. 

3) A method according to daim 1, the number of parameters beirig greater than 1000. 

4) A method according to claim 1, the number of parameters beii^ less than 6000. 

5) A method according to claim 1, the method including determining any conditions 
di^layed by the user. 

20 6) A method according to claim S, the method induding determining the ability of the 
subject to perform in a spoiting and/or radng event in accordance with any determined 
conditions. 

7) A method according to daim 1, the method of performing the comparison induding 
causing the second processing system to: 

25 a) Obtain a set of templates, the set of templates representing differraces between 
groups of individuals; and» 
b) Use the templates to dasstfy the subject data into a respective one of the groups. 

8) A method according to daim 7, the method induding determining one or more 
conditions in accordance with the determined groups. 

30 9) A method according to claim 1, the parameters being representative of the level or 
abundance of a molecule selected from one or more of : 
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a) A polynucleotide; 

b) A polypeptide; and, 

c) A polysaccharide. 

10) A method according to daim 1, the method including: 

5 a) Receiving confinnation of the determined status; and, 

b) Updating the predetermined data in accoitiance with the confirmed status and the 
subject data. 

11) A method according to claim 1, the predetermined data including phcnoiypic 
information of the individuals, and the subject data including phenotypic information 

10 regarding the subject, the ph»K>typic information including details of one or more 
phenotypic traits. 

12) A method according to claim 11, the method including comparing the subject data to 
predetermined data for individuals having one or more phenotypic traits in conmion 
with the subject. 

15 13) Apparatus for determining the status of a subject, the ^paratus including a processing 
system adapted to: 

a) Obtain subject data, the subfect data including one or more paramet^ values, at 
least one of the parameters being indicative of the current biological status of the 
subfect; 

20 b) Compare the subject dau to predetermined data, the predetermined data including 
for each of a number of individuals: 

x) One or more values for the parameters for the respective individual, at least 
some of the individuals having a number of conditions relevant to the status of 
the individual, the number of parameters being statistically sufficient to 
25 distinguish each of the conditions; and, 

ii) An indication of the status of the re^ective individual; and, 
c) Determine the status of the subject in accordance with the results of the 
comparison, 

14) A computer program product for determining the status of a subject, the computer 
30 program product including computer executable code which when executed on a 



■ 
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suitable processing system causes the processing system to perform the method of 
daim 1. 

15) A method of allowing a user to determine the status of a subject, the method including: 

a) Receiving subject data from the user via a communications network, the subject 
5 data including one or more parameto' values, at least one of the parameter being 

indicative of the current biological status of the subject; 

b) Causing the base station to: 

i) Compare the subject data to predetermined data, the predetermined daia 
including for each of a number of individuals: 

10 (1) One or more parameter values for the respective individual; and, 

(2) An indication of the status of each individual; and» 

ii) Determine the status of the subject in accordance with the results of the 
comparison; and, 

c) Transferring an indication of the status of the subject to the user via the 
15 communications network* 

16) A method according to daim IS, the method induding: 

a) Having the user determine the subject data using a remote end station; and, 

b) Transferring the subject data &om the end station to the base station via the 
communications network. 

20 17) A method according to daim 15» the base station including first and second processing 
systems, the method including: 

a) Tiansfciring the subject data to the first processing ^stem; 

b) Transfming the subject data to the second processing system; and, 

c) Causing the second processing system to perform the comparison. 
25 18) A method according to daim 17, the method including: 

a) Transferring the results of the comparison to the first processing system; and, 

b) Causing the first processing system to determine the status of the subject. 
19) A method according to claim 17, the method including at least one of: 

a) Transferring the subject data between the communications network and the first 
30 processing system through a first jSrewall; and. 
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b) Transferring the subject data between the first and the second processing systems 
through a second lirewalL 

20) A method according tt> claini 17, the second processing system being coupled to a 
database adapted to store the predetermined data, the method including: 

5 a) Querying the database to obtain at least selected predetermined data from the 
database; and, 

b) Compare the selected predetermined data to the subject data. 

21) A method -according to claim 17> the second processiiiig system being coupled to a 
subject database, the m^hod including storing the sul^ect data in the subject database. 

10 22) A method according to daim 15, the status including details of any conditions of the 
individuals, the method including determinins conditions displayed by the user. 
23) A method according to claim 22> the method including determining the ability of the 
subject to perform in a sporting and/or racing event in accordance with any determined 
conditions. 

15 24) A method according to daim 22, the method of performing the comparison including 
causing the second prooessii^ system to: 

a) Obtam a set of templates, the set of templates representing differences between 
groups of individuals; and, 

b) Use the templates to classify the subject data into a respective one of the groups. 
20 25)A method according to daun 24, the method including determining one or more 

conditions in accordance with the determined groups. 

26) A method according to daim IS, the parameters being representative of the level or 
abundance of a molecule selected from one or more of: 

a) A polynudeotide; 
25 b) A polypeptide; and, 

c) A polysaccharide. 

27) A method according to daim 15, the method including having the user determine the 
subject data using a secure array, the secure array having a number of features each 
located at respective position on the array, and a respective serial number, the method 

30 including causing the base station to: 

a) Determine the serial number from the subject data; 
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b) Detennine a layout indicating the position of each feature on the array; 

c) Detenniiiing the parameter values in accordance with the determined layout^ and 
the subject data. 

28) A method according to claim 15, the method including: 

S a) Receivingconfirmationof the determined status from the user; and> 

b) Updating the predetermined data tn accordance with the confirmed status and the 
subject data. 

29) A mediod according to claim 21 i the features including at least one of: 

a) An oligonucleotide; 
10 b) A peptide; and, 

c) An antibody. 

30) A method according to daim 15^ the predetennined data including phenotypic 
information of the individuals^ and the subject data including phenotypic information 
regarding the subject, the phenotypic information including details of one or more 

IS phenotypic traits. 

31) A method according to daim 17, the method including comparing the subject data to 
predetermined data for individuals having one or more phenoQi)ic traits in common 
with the subject 

32) A method according to claim IS, the method including causing the base station to: 

20 a) Determine payment information^ the payment information representing the 
provision of payment by the user; and, 

b) Perform the comparison in response to the determination of the payment 
information. 

33) A base station for determining the status of a subject, the base station induding: 

25 a) A store method for storing predetermined data, the predetermined data including 
for each of a number of individuals: 

i) One or more parameter va1ue$> at least one of the parameters being indicative of 
the current biological status of the individual; . 

ii) An indication of the status of the individual; and, 

30 b) A processing system, the processing system being adapted to: 
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i) Receive subject data from the user via a comiDumcations network^ the subject 

data including one or more parameter values; 
if) Compare the subject data to the predetermined data; 

iii) Determine the status of the subject in accordance with the results of the 
comparison; and^ 

iv) Output an indication of the status of the subject to the user via the 
communications network. 

34) A base station according to claim 33> the processii^ system being ad^ned to receive 
subject data from a remote end station adapted to determine the subject data. 

35) A base station according lo claim 33, the processii^ ^^stem includirig: 

a) A first processing system adapted U>: 

i) Receive the subject data; and 

ii) Determine the status of the subject in accordance with the results of the 
comparison; and, 

b) A second processing sysi^ adapted to: 

i) Receive the subject data firom the processing system; and^ 

ii) Perform the comparison; and, 

iii) Ttansfer the results to the first processing system* 

36) A base station according to claim 35, the base station including: 

a) A first firewall for coupling the fiist processing system to the communications 
network; and» 

b) A second firewall for coupling the first and the second processing systems. 

37) A base station according to claim 35, the procesring system being coupled a subject 
database, the processing system adapted to store the subject data in the subject 
database. 

38) A method according to claim 35, the method of performing the comparison including 
causing the second processing system to: 

a) Obtain a set of templates, the set of templates representing differences between 
groups of individuals; and, 

b) Use the templates to classify the subject data into a respective one of the groups. 
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39) A method according to claim 38, the method inclading detennimiig one or more 
conditions in accordance with the detennined groups. 

40) A base station according to claim 33, the subject data being determined using a secure 
array, the secure array having a number of features each located at respective position 

5 on the anay, and a respective serial number, the processing system being adapted to: 

a) Determine the serial number from the subject data; 

b) Determine a layout indicating the position of each feature on the array; 

c) Determining the parameter values in accordance with the determined layout, and 
the subject data. 

10 41) A base station according to claim 33, the processing system being adapted to: 

a) Receive confinnation of the determined ability; and, 

b) Update the predetermined data in accordance with the determined ability and the 
subject data. 

42) A base station according to claim 33, the predetermined data including phenotypic 
15 information of the individuals, and the subject data induding phenotypic information . 

regarding the subject, the phenotypic information including detafls of one or more 
phenotypic traits. 

43) A computer program product for determining the status of a subject, the computer 
program product includii^ computer executable code yAdch when executed on a 

20 suitable processing system causes the processing system to perform the method of 
claim 16. 

44) An end station adapted to determine the status of a subject, the end station includiqg a 
processor ad^yted to: 

a) Determine subject data from the user via a commimications netwozk, the subject 
25 data including one or more parameter values^ at least one of the parameter being 

indicative of the current biological status of the subject; 

b) Transfer the subject matter to a base station via a communicatioDs network, the 
base station being adapted to: 

i) Compare the subject data to pred^ermined data for one or more individuals^ the 
30 predetermined data including: 

(1) One or more parameter values for the respective individual; and, 
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(2) An indication of the status of each individual; and» 
ii) Determine the status of the subject in accordance with the results of the 
comparison; and, 

c) Receive an indication of the status of the subject lo the user via the 
5 communications network. 

45) A computer program product &r determining the status of a subject, the computer 
program product including computer executable code which when executed on a 
suitable processing system causes the processing system to operate as an end station 
according to claim 44. 

10 46) A method of determining the ability of a subject to perform in a sporting and/or racing 
event, the method indudiiig: 

a) Obtaining subject data» the subject data including one or more parameter values, at 
least one of the parameter being indicative of the current biological status of the 
subject; 

15 b) Comparing the subject data to predetermined data, the predetermined data 
including for each of a number of individuals: 
i) One or more parameter values for the rcq>6ctive individual; and, 
it) An indication of the status of each individual; 

c) Determining the status of the subject in accordance with the results of the 
20 comparison; and» 

d) Providing an indication of the ability in accordance with the results of the 
comparison. 

47) A method according to claim 46, the status of eadi individual indicating any conditions 
displayed by the user» the method including: 

25 a) Detennim'ng any conditions displayed by tiic user in accordance with the i^ults of 
the comparison; and, 
b) Determining the ability in accordance with the determined conditions. 

48) Apparatus for determining the ability of a subject to perform in a sporting and/or racing 
event, the apparatus including a processing system adapted to: 
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a) Obtain subject data, the subject data including one or more parameter values, at 
least one of the parameter being indicative of the current biological status of the 
subject; 

b) Compare the subject data to predetermined data, the pi^deteimined data including 
5 for each of a number of individuals: 

i) One or more parameter values for the respective individual; and, 

ii) An indication of the status of each individual; 

c) Determine the status of the subject in accordance with the results of the 
comparison; and^ 

10 d) Provide an indication of the ability in accordance with the results of die 
comparison. 

49) A computer program produa for determining the ability of a subject to perform in a 
sporting and/or racing event, the computer program product Including computer 
executable code which when executed on a suitable processing system causes the 

IS processing system to perform the method of daim 46. 

50) A method of providing secure arrays for use, each aixay including a number of 
predetermined features, the method including: 

a) Detcrmim'ng a number of respe<^vc fieature laycnits, each Jayout representing the 
positioning of each feature on a respective arra^ 
20 b) Determining a number of serial numbers, each serial number corresponding to a 
respective layout; 

c) Generating a number of arrays, each array being generated in accordance with a 
respective layout, and including the corresponding serial number thereon, the serial 
number being used in processing used the anay. 
25 51) A method according to daim 50, the method being performed to provide the arrays on 
behalf of an entity, the method including providing an indication of the layouts and 
corrcq)onding serial numbers to the entity, to thereby allow the entity to process the 
arrays. 

52) A method according to claim 50, the mcOiod of determining the layouts including: 
30 a) Determining a preferred layout; and. 
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b) Moving the position of one or more of the features from the position in the 
preferred layout to alteniative position. v 
53) A method according to claim 52, the method including: 
a) Detenmning the type of each feature; and, 

b> E'f<*ai«iiigihepositionofoneormoicfeatureshavingdxfreTentfeaturety^^ 
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